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(54) Title: TARGET SPECIFIC SCREENING ANDITS USE FOR IDENTIFYING TARGET BINDERS 

(57) Abstract: The present invention relates to a modified phage display method for detecting and identifying target and target 
binders. The modified methods involve transforming host cells with two separate phages, one comprising a target or target library and 
the other comprising a target-binder library, and selecting to eliminate the non-paired targets and binders. Also related are complexes 
comprising target and target binders, as well as antibodies that bind to these complexes. Additionally, the present invention also 
relates to nucleic acids encoding target and target binders peptides. The invention also relates to the use of the sequence information 
inherent in the targets and target binders for target validation using in silico approaches. In addition, the invention also relates to 
diagnostics and therapeutics employing the disclosed target and target binder pepetides or polynucleotides and their use in small 
molecule drug discovery. In particular, the present invention relates to diagnostic and therapeutic applications directed to cancers 
and similar disorders and the isolation of small molecule drugs by using the target and target binders to generate site directed assays 
for high throughput drug screening. 
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TARGET SPECIFIC SCREENING AND ITS USE FOR IDENTIFYING 

TARGET BINDERS 

[0001] This application claims priority to Provisional Application Serial No. 

60/345,471 filed October 24, 2001, hereby incorporated by reference. 

FIELD OF THE INVENTION 
[0002] This invention is in the general field of functional genomics through 

identification of biological molecules, including nucleic acids, protein, and the like, 
more particularly, the identification of targets and their specific binders. 

BACKGROUND OF THE INVENTION 
[0003] A concerted international effort has lead to the deciphering of the 

human genome. Originally, it was estimated that greater than 100,000 genes would 
be encoded within the genome. However, research indicated that the >3.2 billion 
DNA units comprising the genome "only" coded for 30,000-50,000 genes (Human 
Genome Sequencing Consortium and Celera Genomics). Despite rapid progress in 
identifying genes, progress in identifying the activity and function of the gene 
products lags significantly behind. There is additional evidence that the actual 
number of proteins may reach 100,000-200,000 when one considers splice variants 
and post-translational modifications. This indicates that the actual number of 
possible protein targets for drug discovery is much greater than would be anticipated 
from the sequencing of the human genome. Also, there are potential non-protein 
targets for drug discovery including nucleic acids (e.g., mRNA and promoter regions 
of disease related genes) and lipids (e.g., members of the phophoinositol family of 
secondary messages). Thus, while sequencing of the human genome was a great leap 
forward for modern science, it is only the first step in determining the relationships 
between the majority of these genes and disease and their subsequent use as validated 
targets for drug discovery. 
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[0004] Many gene products function by binding to one or more other peptides 

or proteins. Presently, there are few approaches for identifying a protein's binder, 
e.g., the protein with which the target gene product directly interacts. This 
information is critical as proteimligand interactions are involved in important cell 
processes such as signaling (e.g., information transfer in signaling cascades) and 
molecular processing. The identification of protein binders can be used to determine, 
for example, the ligand for a receptor, the substrate for an enzyme, and the regulatory 
protein for an enzyme complex. 

[0005] Binder information is critical to developing a target-binding assay for 

the identification of drug leads. There is a large number of binding assays that exist, 
including in vitro and in vivo formats. Most in vitro formats require input of both 
target and ligand or target binder, and only a few formats require only target input. 
However, formats that require only a protein target generate a high frequency of false 
positives, i.e., compounds that bind but do not cause a change in target activity. Such 
formats would require extensive screening to identify ligands for previously 
uncharacterized protein targets. 

[0006] At present, there are several approaches to finding and validating new 

drug discovery targets. For example, functional genomics involves the use of 
differential techniques (such as microarrays) to discern differences between normal 
and disease-related genes. A subset of this approach uses computational techniques 
to "mine" the public and private databases for differentially expressed genes. For 
example, tyrosine kinases, such as the epidermal growth factor receptor (EGFR) have 
been shown to be involved in a wide variety of cancers such as breast, prostate and 
colorectal carcinoma. In addition, others have used "bait and prey" techniques to 
identify natural partners and validate targets for drug discovery. 

[0007] In the classical approach to binder identification, the protein target and 

its natural partner or target binder are isolated in a complex. In a modern approach, 
the target and natural partner or target binder are constructed as two fusion proteins 
that generate a signal upon interaction. As examples of the latter, yeast two-hybrid 
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systems have been developed. The original intent of the yeast two-hybrid was to 
define the interactions between two proteins in a simple high throughput manner. 
The system utilizes two fusion elements consisting of a DNA-binding domain or bait, 
and a transcriptional activation domain or prey. These two chimeras could then be 
introduced into yeast cells with a reporter. Binding of bait to prey leads to the 
activation of the reporter with the appropriate readout such as growth in defined 
medium. Other two-hybrid like approaches include the bacterial two hybrid system 
(U.S. Patent 5,925,523 to Dove et al.). However, two-hybrid systems have several 
disadvantages, including high levels of false positives, incompatibility with certain 
targets (e.g., RNAs and membrane bound proteins cannot be used), and problems 
with postradiational modifications. Moreover, approaches based on two-hybrid 
systems are not easily applied to a large number of genes of unknown function. 

[0008] Phage display is a useful approach as a selection technique in which a 

peptide or protein is expressed as a fusion with a coat protein of a bacteriophage 
(phage), resulting in display of the fused protein on the exterior surface of the phage 
virion. Briefly, phage display has been used to create a physical linkage between a 
large library of random peptide sequences to the DNA encoding each sequence, 
allowing for rapid identification of peptide ligands for a variety of target molecules, 
such as for example, antibodies, enzymes, cell-surface receptors, etc., by an in vitro 
selection process called biopanning (Parmley, S.F. and Smith, G.P. (1988) Gene 73, 
305-318; Reviewed in Cortese, R. et al. (1995) Curr. Opin. Biotechnol. 6, 73-80; 
Noren, CJ. (1996) NEB Transcript 8 (1), 1-5). 

[00091 Briefly, biopanning is performed by incubating a library of phage- 

displayed peptides with a target, removing unbound phage, and eluting the 
specifically bound phage. However, a purified target is necessary in the preferred use 
of this methodology. The purification of active target is a cumbersome step made 
much more difficult when the protein under investigation does not yet have a known 
function needed to monitor the production of active protein. As such methodology 
does not afford the investigator a means for identifying large populations of unknown 
proteins such as those found to be differentially expressed in one cell versus another, 
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or under some disease related condition at a time. Furthermore, when using one 
target at a time, the present approach requires that once the eluted phage is amplified, 
several cycles, usually 3 to 4 rounds, of biopanning and amplification is essential for 
successfully enriching the phage pool of tightly binding sequences. 

[0010] Identifying novel disease-related targets, disease related populations 

of proteins, and their use in high throughput drug discovery is highly desirable for 
pharmaceutical and biotechnology companies in the post-genomic era. The 
traditional process of drug discovery relies on only a limited number of targets that 
could be screened using small chemical or natural product libraries. With the advent 
of biotechnology however, recombinant proteins and monoclonal antibodies became 
available as drugs to treat various diseases. In general, the use of these reagents had a 
solid experimental base prior to their use. For example, erythropoietin (EPO) is a 
growth factor involved in the regeneration of red blood cells by activating its cognate 
receptor. Knowledge of the EPO/EPO receptor nexus allowed the search for the 
ligand to proceed and eventually succeed. Similarly, monoclonal antibodies such as 
Herceptin (anti-erb B2) and C225 (anti-EGFR) are based on a body of experimental 
data dating to the identification of these receptors and their relationship to cancer. 

[0011] Thus, there is a need for a screening system that ameliorates or 

overcomes one or more of the above or other encountered problems. An ideal system 
would allow the sampling of very large numbers of specificities of entire populations. 
These populations, i.e., proteomes, could, for example, contain protein members that 
are differentially expressed on one cell versus another. An ideal system would allow 
for the sampling of populations of targets having >106 members and populations of 
potential target binders having >101 1 members. An ideal system would also allow 
for rapid sorting during a cloning round and rapid transfer of the genetic material 
coding for the binding molecule from one stage of the production process, to the next 
stage. Therefore, a rapid and unencumbered selection method for identifying and 
isolating novel disease-related targets and their use in high throughput drug discovery 
is highly desirable. 
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SUMMARY OF THE INVENTION 
[0012] This invention provides a method for identifying molecules, including 

nucleic acids, polypeptides, fragments thereof, and the like, that bind to a target of 
interest. 

[0013] The invention provides a method for identifying a biological molecule 

of interest or target, including nucleic acids, polypeptides, fragmnets thereof, and the 
like, and their corresponding target binders using a modified phage display method. 
The method involves mixing in a reaction vessel two separate phage, one comprising 
a target or target library and the other comprising a target-binder library each having 
different selection markers, forming mated or complexed pairs, transforming the 
mated or paired complexes into host cells, followed by selection to eliminate the non- 
paired targets and binders. After selection, the target and/or the target binders can be 
identified by sequencing the DNA inserts in the target and target-binder phage DNA 
associated with the selected pairing of target and target binder. This embodiment of 
the invention also allows for simultaneous screening for a single biological molecule 
or target, or a multiplicity of biological molecules and their target binders. 

[0014] This invention can be used to identify target binders that are capable 

of binding a specific target having known or unknown function. The target binders 
may be identified from phage libraries or cDNA libraries that express, for example, a 
fully randomized peptide library or a partially randomized peptide library on the 
surface or exterior of the phage. Further, the invention can be used to identify natural 
binding partners from a cDNA library for a particular peptide or protein. Moreover, 
the invention may also be used to identify target binders found within the same or 
between different cDNA libraries. 

[0015] This invention also relates to obtaining target binders that can be used 

in assays for screening the binding of a target and other target binders. The identified 
target binders may be used in competition assays with other molecules to screen 
target binding. The other molecules can be any type of molecule, for example, 
peptides, proteins, carbohydrates, lipids or small organic compounds. 



WO 03/035839 PCT/US02/34021 

-6- 



[0016] This invention also relates to targets and target binders isolated by the 

methods of the invention that may be used as tools for further analysis leading to the 
treatment of diseases. One example of the present invention relates to the target 
DGI-2 and DGI-2 binders isolated by the methods of the present invention, which 
may be used as tools for further analysis leading to the treatment of diseases. 

[0017] In another embodiment of the invention, the library from which target 

binders are identified according to their ability to bind to the target, is a random 
peptide phage display library, where peptides range in length from about 9 to about 
100 amino acids, and preferably 20 to 40 amino acids. The library from which the 
target binders are identified is a peptide phage display library, where only certain 
amino acids within the peptide are selectively randomized. Also the phage display 
library may express a cDNA library. In this embodiment, it is possible to identify a 
natural binding partner of the target. In a further embodiment of the invention, both 
the target library and the target-binder library are cDNA libraries, which expresses 
the amino acid sequences encoded by the cDNA library on the surface or exterior of 
the phage. 

[0018] It is a further object of the invention to provide target binders having 

biological activity (e.g., agonists, antagonists, inhibitors, or other modulators) that 
bind to target polypeptides, variants, or fragments thereof using a target-binder phage 
library and a target phage construct. 

[0019] It is another obj ect of the invention to provide a method for 

identifying an amino acid sequence motif that confers binding properties to a target 
by screening a library of expressed amino acid sequences for binding to the target, 
determining the amino acid sequence of the members of the library which bind to the 
target, and identifying as motifs common amino acid sequences. 

[0020] It is yet another object of the invention to provide compositions (e.g., 

pharmaceutical compositions) comprising one or more targets and/or target binders 
polynucleotides, polypeptides, peptides, or antibodies. 



WO 03/035839 



-7- 



PCT/US02/34021 



[0021] It is a further object of this invention to provide target peptide binders 

to the protein encoded by the target gene, wherein the target peptide binders are those 
having SEQ ID NOs: 1-189 and SEQ ID NOs: 227-532. 

[0022] Additional obj ects and advantages afforded by the present invention 

will be apparent from the detailed description and exemplification herein below. 

DESCRIPTION OF THE FIGURES 
[0023] The appended drawings of the figures are presented to further describe 

the invention and to assist in its understanding through clarification of its various 
aspects. In the figures of the present invention, the nucleotide and amino acid 
sequences are represented by their one-letter abbreviations. 

[0024] FIG. 1 shows potential binding motifs of DGI-2 binders. "PP" 

identifies the peptides as having been isolated by the proteome panning technique 
described below. The other peptides were isolated using standard panning 
approaches as described in Pillutla et al. (BMC Biotechnology 1:6, 2001). The 
number shown in subscript following each peptide name represents the number of 
times the peptide was isolated .from the DGI-2 clones. For example, DGI2-20R-4- 
H814 was isolated 14 times after 4 rounds of conventional panning using a random 
20-mer peptide phage display library. Etag - peptide binding to E tag antibody 
(AAAGAPVPYPDPLEPRP; SEQ ID NO: 190) fused to peptide; DGI-2 - binding to 
DGI-2 polypeptide (specific binding); LDH - peptide binding to LDH polypeptide 
(non-specific binding); DGI-2/LDH - ratio of specific binding to non-specific 
binding. Values shown represent ELISA units read at 405 nm. Higher values 
indicated higher binding specificity for target. Q represents a CAG stop; * represents 
a TGA stop; # represents a TAA stop; + represents any polar amino acid (D, E, K, R, 
H, N, and Q); and § represents any hydrophobic amino acid (A, F, G, I, L, M, T, V, 
and W). 
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[0025] FIGS . 2 A-2B show a schematic diagram of the preparation of the 

cDNA libraries in the modified tetracycline-resistant phage and the use of cDNA 
libraries in one or both of the phage libaries. 

[0026] FIG. 3 shows the DNA sequence of the DGI-2, which was cloned into 

the tetracycline resistant vector, pCANTAB(tet). 

[0027] FIG. 4 shows sequences from 070902-DGI2-20M-PP-BC Proteome 

Pan. Amino acid sequences are represented by their one-letter abbreviations. 
Binding is indicated as the ratio over background, Specific/ Irrelevant (Sp/Irr). LDH 
= Lactate dehydrogenase (control irrelevant protein). 

[0028] FIGS. 5A-5B show sequences from 070902-Hras-20M-PP-BC 

Proteome Pan. Amino acid sequences are represented by their one-letter 
abbreviations. Binding is indicated as the ratio over background (Sp/Irr). LDH = 
Lactate dehydrogenase (control irrelevant protein). 

[0029] FIG. 6 shows sequences from 070902-Leptin-20M-PP-BC Proteome 

Pan. Amino acid sequences are represented by their one-letter abbreviations. 
Binding is indicated as the ratio over background (Sp/Irr). LDH = Lactate 
dehydrogenase (control irrelevant protein). 

[0030] FIGS. 7A-7K show results of a database query showing only Homo 

sapiens (human) hits transferred into Excel. The Access database can be easily 
queried for specific values (or value range) in all fields. Results can be easily 
transferred or exported. Separate tables containing results from different panning 
experiments can also be queried together for cross reference. 

[0031] FIGS. 8A-8G show a database of human genes and the sequences of 

binding peptides that were identified by panning protocols using libraries, described 
herein (DGI Biotechnologies). The database also provides identification numbers to 
related information such as the gene, protein, and locus. 

[0032] FIGS. 9A-9B show the sequences of DGI-5 Binders. Amino acid 

sequences are represented by their one-letter abbreviations. Binding is indicated as 
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the ratio over background. Insulin growth factor receptor (IGFR) was used as the 
control irrelevant protein. Numbers in subscript at the end of a clone name indicate 
the number of times the sequence was found in that panning. 

[0033] FIG. 10 shows sequences of DGI-7 Binders. Amino acid sequences 

are represented by their one-letter abbreviations. Binding is indicated as the ratio 
over background. LDH = Lactate dehydrogenase (control irrelevant protein). 
Numbers in subscript at the end of a clone name indicate the number of times the 
sequence was found in that panning. 

[0034] FIG. 1 1 shows sequences of Vascular Endothelial Growth Factor 

(VEGF) Binders. Amino acid sequences are represented by their one-letter 
abbreviations. Binding is indicated as the ratio over background. BSA was used as 
the irrelevant protein because target protein was resuspended in PBS + BSA. 
Numbers in subscript at the end of a clone name indicate the number of times the 
sequence was found in that panning. 

[0035] FIGS. 12A-12B show sequences of Vascular Endothelial Growth 

Factor- Receptor 1/ FMS-related Tyrosine Kinase 1 (VEGF-R1 / FLT1) Binders. 
Amino acid sequences are represented by their one-letter abbreviations. Binding is 
indicated as the ratio over background. BSA was used as the irrelevant protein 
because target protein was resuspended in PBS + BSA. Numbers in subscript at the 
end of a clone name indicate the number of times the sequence was found in that 
panning. 

[0036] FIG. 1 3 shows sequences of Vascular Endothelial Growth Factor- 

Receptor 2 / FLK1 / Kinase insert Domain protein Receptor (VEGF-R2 / FLK1 / 
KDR) Binders. Amino acid sequences are represented by their one-letter 
abbreviations. Binding is indicated as the ratio over background. BSA was used as 
the irrelevant protein because target protein was resuspended in PBS + BSA. 
Numbers in subscript at the end of a clone name indicate the number of times the 
sequence was found in that panning. 
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[0037] FIG. 14 shows sequences of Vascular Endothelial Growth Factor 

Receptor 3 (VEGF-R3) Binders. Amino acid sequences are represented by their one- 
letter abbreviations. Binding is indicated as the ratio over background. BSA was 
used as the irrelevant protein because target protein was resuspended in PBS + BSA. 
Numbers in subscript at the end of a clone name indicate the number of times the 
sequence was found in that panning. 

[0038] FIG. 1 5 shows sequences of Epithelial Growth Factor Receptor 

(EGFR) Binders. Amino acid sequences are represented by their one-letter 
abbreviations. Binding is indicated as the ratio over background. LDH = Lactate 
dehydrogenase (control irrelevant protein). Numbers in subscript at the end of a 
clone name indicate the number of times the sequence was found in that panning. 
Two separate panning experiments are indicated by the subscript alpha or beta. 

[0039] FIGS . 1 6 A- 1 6B show sequences of Fibroblast Growth Factor 

Receptor 1 (FGFR1) alpha and FGFR1 beta Binders. Amino acid sequences are 
represented by their one-letter abbreviations. Binding is indicated as the ratio over 
background. BSA was used as the irrelevant protein because target protein was 
resuspended in PBS + BSA. Numbers in subscript at the end of a clone name 
indicate the number of times the sequence was found in that panning. 

[0040] FIGS. 17A-17B show sequences of Tie- 1 Binders. Amino acid 

sequences are represented by their one-letter abbreviations. Binding is indicated as 
the ratio over background. BSA was used as the irrelevant protein because target 
protein was resuspended in PBS + BSA. Numbers in subscript at the end of a clone 
name indicate the number of times the sequence was found in that panning. 

DETAILED DESCRIPTION OF THE INVENTION 
[0041] The present invention relates to methods for detecting and identifying 

known and unknown targets, and their target binders. The present invention involves 
using phage display libraries to identify target binders for known or unknown targets. 
The phage display libraries may be peptide libraries or cDNA libraries that express 
the peptides or encoded amino acid sequence on the surface or exterior of the phage. 
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The method involves combining two separate phage to allow binding of the 
expressed target and target binders and then transforming host cells with the 
combination. The members of specific binding pairs are then selected by using 
multiple selection markers. The present invention also relates to the binding 
molecules produced by these methods. 

Definitions 

[0042] The term "target" as used herein refers to a biological molecule of 

interest, including nucleic acids, proteins (intracellular, transmembrane, 
extracellular), amino acids, polypeptides, fragments thereof, and the like. Targets 
may be, for example, receptors, enzymes, binding proteins, antibodies or 
polypeptides of known or unknown function. For example, the polypeptide DGI-2, 
amino acids, cDNAs, or peptides are targets of the invention. 

[0043] The term "target binder" as used herein refers to a molecule that binds 

to a target and can include nucleic acids, amino acids, polypeptides, fragments 
thereof, and the like. Target binders also include but are not limited to ligands, small 
molecules, and drugs. 

[0044] The term "vector" as used herein refers to a nucleic acid molecule 

capable of replicating another nucleic acid to which it has been linked. A vector, for 
example, can be a plasmid. 

[0045] "Host cells" includes prokaryotes and eukaryotes. The term includes 

an organism or cell that is the recipient of a phage or phage library. 

[0046] "Isolated", as used herein, refers to nucleic or amino acid sequences 

that are at least 60% free, preferably 75% free, and most preferably 90% free from 
other components with which they are naturally associated. 

[0047] As used herein, the terms "protein" and "polypeptide" are 

synonymous. "Peptides" are defined as fragments or portions of polypeptides, 
preferably fragments or portions having at least one functional activity (e.g., binding, 
signaling, or antigenic activity) as the complete polypeptide sequence. 
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[0048] The term "antigenic" refers to the ability of a molecule (e.g., a 

polypeptide or peptide) to bind to its specific antibody, or an antibody fragment, with 
sufficiently high affinity to form a detectable antigen-antibody complex. 

[0049] The term "ligand" as used herein describes any molecule, protein, 

peptide, or compound with the capability of directly or indirectly altering the 
physiological function, stability, or levels of a polypeptide or polynucleotide. In 
specific aspects, ligands may directly bind to a polypeptide or polynucleotide. 
Ligands may include nucleic acids, carbohydrates, or small organic compounds. 

[0050] A "sample" as used herein refers to a biological sample, such as, for 

example, tissue or fluid isolated from an individual (including, without limitation, 
plasma, serum, cerebrospinal fluid, lymph, tears, saliva, milk, pus, and tissue 
exudates and secretions) or from in vitro cell culture constituents, as well as samples 
obtained from, for example, a laboratory procedure. 

[0051] General descriptions of the foregoing terms and others are known in 

the art. See, e.g., Roitt et al., 1989, Immunology, 2 nd Edition, C.V. Mosby Company, 
New York; Male et al., 1991, Advanced Immunology, 2 nd Edition, Grower Medical 
Publishing, New York. 

Proteome Panning Method 

[0052] This invention provides a method of selecting and identifying target 

and target binders, by transforming host cells with two separate phages, one 
comprising a target-binder library and the other comprising a target or target library, 
followed by selection for the pairing of the target and target binder, where the general 
method is termed herein "proteome panning". 

[0053] In general, the present invention involves the following steps: 

[0054] 1 . Preparing a phage comprising a target and a selection marker, 

where the target may be an individual target or a mixture of known or unknown 
targets from, for example, cDNA or subtracted cDNA libraries. The target may be a 
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full-length protein, parts of proteins or domains involved in biological interactions. 
The target may be of known or unknown function; 

[0055] 2. Obtaining a phage target-binder library and a second selection 

marker, where the phage may express peptides, full-length proteins, parts of proteins, 
or domains involved in biological interactions. The peptides or proteins may be fully 
or partially randomized or may be from cDNA library; 

[0056] 3 . Panning, where the target phage and the target-binder phage 

are mixed in solution and targets and target-binders are allowed to mate or form 
complexed pairs. The process may be performed for 2-24 hours, preferably 2-3 
hours, at a temperature range of 28°C - 37°C, preferably at a temperature range of 
28°C - 30°C. 

[0057] 4. Transforming host cells with the mixture of target phage and 

target-binder phage library, wherein the mated or complexed pairs infect the host 
cells; 

[0058] 5 . Selecting for host cells infected by phage that formed 

complexes of target and target binders by using various selection agents including, 
but not limited to, antibiotics such as ampicillin and tetracycline, auxotrophic 
markers, and fluorescent markers. 

[0059] 6. Analyzing target and target binders by sequencing and using 

computational approaches. 

[0060] In a preferred embodiment of the invention, the target may be an 

expressed peptide or protein or a single expressed cDNA. In another preferred 
embodiment of the invention, the target could be a target library comprising a cDNA 
phage library. The cDNAs would preferably range in size from about 200 base pairs 
to 2500 base pairs, more preferably 350-1500 base pairs. The target-binder library is 
preferably a fully randomized peptide phage display library comprising peptides 
ranging in length from about 9 amino acids to 100 amino acids, more preferably 20- 
40 amino acids. In another embodiment of the invention, the target-binder library 
may have only particular amino acids selectively randomized or other constraints 
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built into the its sequence. Non-limiting examples of targets for target binders 
include: DGI-2, DGI-5, DGI-7, DGI-9, VEGF, VEGFR1, VEGFR2, VEGFR3, 
EGFR, FGFRl-alplu^eta, and Tie-1. The target protein, in one embodiment, is 
DGI-2. DGI-2 was identified as an uncharacterized gene in a public database 
designated KIAA0101 (GenBank Acc. No. XM_042258). The nucleotide sequence 
enclosing DGI-2 (SEQ ID NO: 533) is shown in Figure 3. The target proteins for 
DGI-5, DGI-7 (metallothionein-1 1), and DGI-9 (HEC) were identified in public 
databases having the GenBank Acc. No. Accession numbers AA7073 1 9, X767 1 7, 
and NM_006101, respectively. 

[0061] In another embodiment of the invention, techniques may be used for 

enriching only the high affinity binders to the targets. These may include, but are not 
limited to, pre-incubation time, changes in temperature, pH, ionic strength, 
incubation in the presence of competing target binders and washing steps with 
different buffers. Other techniques may also include a second or multiple rounds of 
proteomic panning which includes amplification steps, where the population of 
selected colonies are amplified, rescued and repanned. 

[0062] In a further embodiment of the invention, the method involves 

proteome panning of known or unknown gene protein products or other targets 
expressed on phage with phage displayed libraries of random peptides or cDNA, and 
obtaining a set of peptides or cDNAs which bind to such targets in such a manner as 
to allow the identification of individual pairs of targets and target binders. Bacteria 
are isolated, either as a population or as single colonies containing a single pairing of 
one target and one target binder. The target and target binder are then sequenced, 
thereby providing additional functional information. When initially using a single 
target phage, displaying a target on the surface of the phage, multiple target binders 
are obtained, for example, in the order of tens to thousands. The targets and target 
binders are sequenced and the target binders used individually or as consensus motifs 
to search for genes, for example, with expressed proteins of matching amino acid 
sequence. These identified consensus motifs may also be used to design selectively 
randomized library of target binders for second proteome pannings. Figure 1 shows 
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four potential binding motifs of DGI-2 binders: Motif 1: GCXXFXGFCV (SEQ ID 
NO: 505); Motif 2: FRXWVEG<|> (SEQ ID NO:512); Motif 3: SXGWXFPGWR 
(SEQ ID NO: 517); and Motif 4: 4>WR±WV±GSLxG (SEQ ID NO: 522). Sequences 
having at least one of these consensus motifs are also shown in Figure 1 (SEQ ID 
NOs: 505-532). 

Phage Display Libraries 

[00631 Phage display libraries can be screened for target binders that bind to 

targets using proteome panning as described above. Details of the construction and 
analyses of these libraries are described in detail herein and have been published (see, 
e.g., WO 96/04557; Mandecki et al., 1997, Display Technologies - Novel Targets 
and Strategies, P. Guttry (ed), International Business Communications, Inc. 
Southborogh, MA, pp. 231-254; Ravera et al., 1998, Oncogene 16:1993-1999; Scott 
and Smith, 1990, Science 249:386-390); Grihalde et al., 1995, Gene 166:187-195; 
Chen et al., 1996, Proc. Natl Acad. ScL USA 93:1997-2001; Kay et al., 1993, Gene 
128:59-65; Carcamo et al., 1998, Proc. Natl. Acad. Sci. USA 95:11146-11151; 
Hoogenboom, 1997, Trends Biotechnol. 15:62-70; Rader and Barbas, 1997, Curr. 
Opin. Biotechnol. 8:503-508; all of which are incorporated herein by reference). 

[0064] In one aspect of the invention, target cDNA phage libraries may be 

used. Of the numerous methods for constructing cDNA libraries, the approach based 
on methods described in Gene, 25:263 (1983) is the most widely used. Approaches 
to achieve enrichment of specific sequences include subtractive cDNA libraries 
{Trends Genet., 9:70 (1993)) and normalized cDNA libraries {Genome Res., 6:791 
(1996)). While subtractive and normalized libraries are generally not full-length 
cDNAs, the frequency of fiill-length cDNAs has been increased by RecA-mediated 
triple-strand formation in a subtractive cDNA library {Nucleic Acid Res., 24:3478 
(1996). An advantage of the invention is that only very small amounts of the protein 
targets are needed, i.e., amounts sufficient for their presentation on phage coat 
proteins, where gene III is preferably present in 3-5 copies per phage. By using a 
natural process of protein generation, e.g. bacterial, to make a protein of unknown 
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function and characteristics, large percentages of protein populations from cDNA 
libraries can be produced at one time for investigation. 

[0065] The present invention may also use peptide libraries to identify new 

binders for the target of interest. Peptide libraries can be designed according to 
methods described in detail herein, and methods generally available to those in the art 
(see, e.g., U.S. Patent No. 5,723,286 issued March 3, 1998 to Dower et al.). In one 
aspect, commercially available phage display libraries can be used (e.g., RAPIDLIB® 
or GRABLIB®, DGI BioTechnologies, Inc., Edison, NJ; Ph.D. C7C Disulfide 
Constrained Peptide Library, New England Biolabs). 

[0066] In another aspect, an oligonucleotide or cDNA library can be prepared 

according to methods known in the art, and inserted into an appropriate vector for 
peptide expression. In order to construct cDNA phage display libraries, commonly 
known techniques in the art are employed. More particularly, three methods for 
producing cDNA that are used in the invention include: 1) random priming and 
directional cloning; 2) PCR-based oligo dT priming and directional cloning; and 3) 
excising cDNA from existing libraries. For example, ligation-ready subtracted or 
unsubtracted cDNA is produced by classical cDNA methods using random priming 
and poly A+ RNA from various normal and cancerous tissues (see Example 14). The 
PCR-based oligo dT priming and directional cloning method may use a modified 
Smart cDNA Synthesis kit (Clontech) as described in Example 14. The third method 
essentially involves excising cDNA from pre-existing cDNA libraries comprising 
normal and cancerous tissues at the appropriate restriction enzyme sites (see Example 
14). Both cDNA and subtracted cDNA phage display libraries may be commercially 
obtained or constructed. Briefly, poly A+ RNA from various tissues may be 
purchased or purified from cells lines. This RNA produces double stranded cDNA of 
a desirable size that may be cloned into the phage display vector to make the cell 
library. In particular, the preferred cDNA size for insertion into the phage display 
vector is 100 base pairs to 3000 base pairs, and more preferably 200 base pairs to 
2000 base pairs. Alternatively, cDNA may be excised from pre-existing libraries. 
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Poly A+ RNA or cDNA from cancerous tissue subtracted in multiple rounds from 
excess normal poly A+ RNA or cDNA generates subtracted libraries. 

[0067] For the phage display system, N-terminal display on gene III may be 

preferably chosen for its favorable high affinity interactions due to the low valency of 
gene III on the phage particle. However, cDNA containing stop codons eliminates 
gene III expression and hence there is no phage display of the recombinant protein. 
In order to eliminate clones with stop codons, the cDNA is first ligated to either of 
two expression screen vectors, one containing the green fluorescent protein (GFP) 
gene and the other containing the kanamycin (kan) gene. In the kanamycin screen, 
three reading frames are employed to increase the probability of in frame clones. 
Fluorescent or kanamycin resistant (kanR) clones are picked, pooled, and amplified 
by cell growth. A large-scale DNA preparation is carried out, digested with 
appropriate enzymes, and cloned into the phage display vector to construct the cell 
library containing all open reading frames. 

[0068] For example, vectors encoding a bacteriophage structural protein, 

preferably an accessible phage protein, such as a bacteriophage coat protein, can be 
used. Although one skilled in the art will appreciate that a variety of bacteriophage 
may be employed in the present invention, in preferred embodiments the vector is, or 
is derived from, a filamentous bacteriophage, such as, for example, fl, fd, Pfl, Ml 3, 
etc. In particular, the fd-tet vector has been extensively described in the literature 
(see, e.g., Zacher et al., 1980, Gene 9:127-140; Smith et al., 1985, Science 228:1315- 
1317; Parmley and Smith, 1988, Gene 73:305-318). 

[0069] The phage vector is chosen to contain or is constructed to contain a 

cloning site located in the 5* region of the gene encoding the bacteriophage structural 
protein, so that the peptide is accessible to receptors in an affinity enrichment 
procedure as described herein below. The structural phage protein is preferably a 
coat protein. An example of an appropriate coat protein is pill. A suitable vector 
may allow oriented cloning of the cDNA or the oligonucleotide sequences so that the 
protein or peptide is expressed at or within a distance of about 100 amino acid 



WO 03/035839 



- 18- 



PCT/US02/34021 



residues of the N-terminus of the mature coat protein. The coat protein is typically 
expressed as a preprotein, having a leader sequence. 

[0070] Another feature of the design of these libraries enables "Rapid Partner 

Identification" by high throughput sequencing. The rapidity of partner identification 
is dependent on the ability to sequence, individually, each of the two clones within a 
single colony which have identical vector sequences except for the respective 
antibiotic resistance gene (e.g. ampicillin or tetracycline). To accomplish this, a 
single PCR product must be obtained when sequence is desired for the clone 
originating from either the ampicillin (amp) resistant (ampR) or tetracycline (tet) 
resistant (tetR) phage libraries. To achieve this end, minor changes were 
incorporated in vector sequences with corresponding specific primers which 
differentiate between the tet and amp library clones. Given the analysis that the 
human genome contains 30,000 to 66,000 genes, multiple libraries are expression 
screened and cloned into the phage display vector until the combined diversity is 1 x 
10 s expression positive clones, i.e. open reading frames (ORFs), for the unsubtracted 
libraries. The cell libraries are rescued and the resultant phage libraries, either amp 
or tet resistant are combined and used in all phases of proteome panning. 

[0071] For generating subtractive cDNA phage display libraries, subtractive 

hybridization is utilized. Subtractive hybridization is a powerful technique that 
enables one to compare two populations of mRNA and obtain clones of genes that 
are expressed in one population, but not in the other. Briefly, both mRNA 
populations are converted into cDNA. The target and target binder cDNAs are 
hybridized and the hybrids sequences are removed. Consequently, the remaining 
unhybridized cDNAs represent genes that are expressed in the target comparede to 
the target binder, (see Example 15). 

[0072] Thus, the oligonucleotide or cDNA library is desirably inserted so that 

the N-terminus of the processed bacteriophage outer protein is the first residue of the 
peptide or protein, i.e., between the 3'-terminus of the sequence encoding the leader 
protein and the 5 f -terminus of the sequence encoding the mature protein or a portion 
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of the 5' terminus. A cDNA library or oligonucleotide library is constructed by 
cloning the cDNA or an oligonucleotide which contains the variable region of library 
members (and any spacers, if necessary) into the selected cloning site. Using known 
recombinant DNA techniques (see generally, Sambrook et al., 1989, Molecular 
Cloning, A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y., 1989), an oligonucleotide may be constructed which, inter alia, 
1) removes unwanted restriction sites and adds desired ones; 2) reconstructs the 
correct portions of any sequences which have been removed (such as a correct signal 
peptidase site, for example); 3) inserts spacer residues, if any; and/or 4) corrects the 
translation frame (if necessary) to produce active, infective phage. 

[0073] The central portion of the oligonucleotide will generally contain one 

or more target binding sequences, such as for example, DGI-2-binding sequences 
and, optionally, spacer sequences. The sequences are ultimately expressed as 
peptides (with or without spacers) fused to or in the N-terminus of the mature coat 
protein on the outer, accessible surface of the assembled bacteriophage particles. The 
size of the library will vary according to the number of variable codons, and hence 
the size of the peptides, which are desired. Generally, the library will be at least 
about 10 6 members, usually at least 10 7 , and typically 10 8 or more members. To 
generate the collection of oligonucleotides which forms a series of codons encoding a 
random collection of amino acids and which is ultimately cloned into the vector, a 
codon motif is used, such as (NNK) X , where N may be A, C, G, or T (nominally 
equimolar), K is G or T (nominally equimolar), and x is typically up to about 5, 6, 7, 
8, or more, thereby producing libraries of penta-, hexa-, hepta-, and octa-peptides or 
larger. The third position may also be G or C, designated "S". Thus, NNK or NNS 
1) code for all the amino acids; 2) code for only one stop codon; and 3) reduce the 
range of codon bias from 6:1 to 3:1. 

[0074] It should be understood that, with longer peptides, the size of 

the library that is generated may become a constraint in the cloning process. The 
expression of peptides from randomly generated mixtures of oligonucleotides in 
appropriate recombinant vectors is known in the art (see, e.g., Oliphant et al., Gene 
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44:177-183). For example, the codon motif (NNK) 6 produces 32 codons, one for 
each of 12 amino acids, two for each of five amino acids, three for each of three 
amino acids and one (amber) stop codon. Although this motif produces a codon 
distribution as equitable as available with standard methods of oligonucleotide 
synthesis, it results in a bias against peptides containing one-codon residues. In 
particular, a complete collection of hexacodons contains one sequence encoding each 
peptide made up of only one-codon amino acids, but contains 729 (3 6 ) sequences 
encoding each peptide with only three-codon amino acids. 

[0075] An alternative approach to minimize the bias against one-codon 

residues involves the synthesis of 20 activated trinucleotides, each representing the 
codon for one of the 20 genetically encoded amino acids. These are synthesized by 
conventional means, removed from the support while maintaining the base and 5- 
OH-protecting groups, and activated by the addition of 3'O-phosphoramidite (and 
phosphate protection with b-cyanoethyl groups) by the method used for the activation 
of mononucleosides (see, generally, McBride and Caruthers, 1983, Tetrahedron 
Letters 22:245). Degenerate oligocodons are prepared using these trimers as building 
blocks. The trimers are mixed at the desired molar ratios and installed in the 
synthesizer. The ratios will usually be approximately equimolar, but may be a 
controlled unequal ratio to obtain the over- to under-representation of certain amino 
acids coded for by the degenerate oligonucleotide collection. The condensation of 
the trimers to form the oligocodons is done essentially as described for conventional 
synthesis employing activated mononucleosides as building blocks (see, e.g., 
Atkinson and Smith, 1984, Oligonucleotide Synthesis, M.J. Gait, Ed., p. 35-82). This 
procedure generates a population of oligonucleotides for cloning that is capable of 
encoding an equal distribution (or a controlled unequal distribution) of the possible 
peptide sequences. Advantageously, this approach may be employed in generating 
longer peptide sequences, since the range of bias produced by the (NNK) 6 motif 
increases by three-fold with each additional amino acid residue. 

[0076] When the codon motif is (NNK) X , as defined above, and when x 

equals 8, there are 2.6 x 10 10 possible octa-peptides. A library containing most of the 
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octa-peptides may be difficult to produce. Thus, a sampling of the octa-peptides may 
be accomplished by constructing a subset library using up to about 10% of the 
possible sequences, which subset of recombinant bacteriophage particles is then 
screened. If desired, to extend the diversity of a subset library, the recovered phage 
subset may be subjected to mutagenesis and then subjected to subsequent rounds of 
screening. This mutagenesis step may be accomplished in two general ways: the 
variable region of the recovered phage may be mutagenized, or additional variable 
amino acids may be added to the regions adjoining the initial variable sequences. 

[0077] To diversify around active peptides (i.e., binders) found in proteome 

panning or early rounds of conventional biopanning, the selected phage can be 
sequenced to determine the identity of the active peptides. Oligonucleotides can then 
be synthesized based on these peptide sequences. The syntheses are performed with 
a low level of all bases incorporated at each step to produce slight variations of the 
primary oligonucleotide sequences. This mixture of degenerate or slightly 
degenerate oligonucleotides may then be cloned into the affinity phage by methods 
known to those in the art. This method produces systematic, controlled variations of 
the starting peptide sequences as part of a secondary library. It requires, however, 
that individual positive phage be sequenced before mutagenesis, and thus is useful for 
expanding the diversity of small numbers of recovered phage. 

[0078] An alternate approach to diversify the selected phage allows the 

mutagenesis of a pool, or subset, of recovered phage. In accordance with this 
approach, phage recovered from panning are pooled and single stranded DNA is 
isolated. The DNA is mutagenized by treatment with, e.g., nitrous acid, formic acid, 
or hydrazine. These treatments produce a variety of damages to the DNA. The 
damaged DNA is then copied with reverse transcriptase, which misincorporates bases 
when it encounters a site of damage. The segment containing the sequence encoding 
the receptor-binding peptide is then isolated by cutting with restriction nuclease(s) 
specific for sites flanking the peptide coding sequence. This mutagenized segment is 
then recloned into undamaged vector DNA, the DNA is transformed into cells, and a 
secondary library according to known methods. General mutagenesis methods are 
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known in the art (see Myers et al., 1985, Nucl. Acids Res. 13:3131-3145; Myers et al., 
1985, Science 229:242-246; Myers, 1989, Current Protocols in Molecular Biology 
Vol. /, 8.3.1-8.3.6, F. Ausubel et al., eds, J. Wiley and Sons, New York). 

[00791 In another general approach, the addition of amino acids to a peptide 

or peptides found to be active, can be carried out using various methods. In one, the 
sequences of peptides selected in proteome panning or early conventional panning 
are determined individually and new oligonucleotides, incorporating the determined 
sequence and an adjoining degenerate sequence, are synthesized. These are then 
cloned to produce a secondary library. Alternatively, methods can be used to add a 
second target -binding sequence to a pool of peptide-bearing phage. In accordance 
with one method, a restriction site is installed next to the first target-binding 
sequence. Preferably, the enzyme should cut outside of its recognition sequence. 
The recognition site may be placed several bases from the first binding sequence. To 
insert a second target-binding sequence, the pool of phage DNA is digested and 
blunt-ended by filling in the overhang with Klenow fragment. Double-stranded, 
blunt-ended, degenerately synthesized oligonucleotides are then ligated into this site 
to produce a second binding sequence juxtaposed to the first binding sequence. This 
secondary library is then amplified and screened as before. 

[0080] While in some instances it may be appropriate to synthesize longer 

peptides to bind certain receptors, in other cases it may be desirable to provide 
peptides having two or more target-binding sequences separated by spacer (e.g., 
linker) residues. For example, the binding sequences may be separated by spacers 
that allow the regions of the peptides to be presented to the receptor in different ways. 
The distance between binding regions may be as little as 1 residue, or at least 2-20 
residues, or up to at least 100 residues. Preferred spacers are 3, 6, 9, 12, 15, or 18 
residues in length. For probing large binding sites or tandem binding sites, the 
binding regions may be separated by a spacer of residues of up to 20 to 30 amino 
acids. The number of spacer residues when present will typically be at least 2 
residues, and often will be less than 20 residues. 
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[0081] The oligonucleotide library may have binding sequences which are 

separated by spacers (e.g., linkers), and thus may be represented by the formula: 
(NNK)y - (abc) n - (NNK) 2 where N and K are as defined previously (note that S as 
defined previously may be substituted for K), and y+z is equal to about 5, 6, 7, 8, or 
more, a, b and c represent the same or different nucleotides comprising a codon 
encoding spacer amino acids, n is up to about 3, 6, 9, or 12 amino acids, or more. 
The spacer residues may be somewhat flexible, comprising oligo-glycine, or oligo- 
glycine-glycine-serine, for example, to provide the diversity domains of the library 
with the ability to interact with sites in a large binding site relatively unconstrained 
by attachment to the phage protein. Rigid spacers, such as, e.g., oligo-proline, may 
also be inserted separately or in combination with other spacers, including glycine 
spacers. It may be desired to have the target-binding sequences close to one another 
and use a spacer to orient the binding sequences with respect to each other, such as 
by employing a turn between the two sequences, as might be provided by a spacer of 
the sequence glycine-proline-glycine, for example. To add stability to such a turn, it 
may be desirable or necessary to add cysteine residues at either or both ends of each 
variable region. The cysteine residues would then form disulfide bridges to hold the 
variable regions together in a loop, and in this fashion may also serve to mimic a 
cyclic peptide. Of course, those skilled in the art will appreciate that various other 
types of covalent linkages for cyclization may also be used. 

[0082] Spacer residues as described above may also be situated on either or 

both ends of the target-binding sequences. For instance, a cyclic peptide may be 
designed without an intervening spacer, by having a cysteine residue on both ends of 
the peptide. As described above, flexible spacers, e.g., oligo-glycine, may facilitate 
interaction of the peptide with the selected receptors. Alternatively, rigid spacers 
may allow the peptide to be presented as if on the end of a rigid arm, where the 
number of residues, e.g., proline residues, determines not only the length of the arm 
but also the direction for the arm in which the peptide is oriented. Hydrophilic 
spacers, made up of charged and/or uncharged hydrophilic amino acids, (e.g., Thr, 
His, Asn, Gin, Arg, Glu, Asp, Met, Lys, etc.), or hydrophobic spacers of hydrophobic 



WO 03/035839 



-24- 



PCT/US02/34021 



amino acids (e.g., Phe, Leu, He, Gly, Val, Ala, etc.) may be used to present the 
peptides to receptor binding sites with a variety of local environments. 

[0083] Notably, some peptides, because of their size and/or sequence, may 

cause severe defects in the infectivity of their carrier phage. To minimize problems 
associated with defective infectivity, DNA prepared from the eluted phage can be 
transformed into appropriate host cells, such as, e.g., E. coli, preferably by 
electroporation (see, e.g., Dower et al., Nucl Acids Res. 16:6127-6145), or well 
known chemical means. The cells are cultivated for a period of time sufficient for 
marker expression, and selection is applied as typically done for DNA 
transformation. The colonies are amplified, and phage harvested for affinity 
enrichment in accordance with established methods. Phage identified in the affinity 
enrichment may be re-amplified by infection into the host cells. The successful 
transformants are selected by growth in an appropriate antibiotic(s), e.g., tetracycline 
or ampicillin. This may be done on solid or in liquid growth medium. 

[0084] For growth on solid medium, the cells are grown at a high density 

(about 108 to 109 transformants per m2) on a large surface of, for example, L-agar 
containing the selective antibiotic to form essentially a confluent lawn. The cells and 
extruded phage are scraped from the surface and phage are prepared for proteome 
panning. For growth in liquid culture, cells may be grown in media and antibiotic 
through multiple doublings as necessary. The phage are harvested by standard 
procedures (see Sambrook et al., 1989, Molecular Cloning, 2nd ed.). Growth in 
liquid culture may be more convenient because of the size of the libraries, while 
growth on solid media likely provides less chance of bias during the amplification 
process. 

Affinity Enrichment with Proteome Panning 

[0085] For affinity enrichment of desired clones of target binders, the 

preincubation conditions, i.e. the binding conditions prior to transformation of the 
host cells, are varied to optimize the recovery of target binders with a higher affinity. 
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Phage that associate with a target via non-specific interactions are removed by 
affinity enrichment. 

[0086] In one example of affinity enrichment, the ratio of target phage to 

target-binder phage is optimized. Generally, about 10 1 to 10 9 library equivalents are 
incubated with phage expressing a target, or portion thereof, to which the desired 
target binder is sought. A library equivalent is equal to one member of each 
recombinant, e.g., 10 4 equivalents of a library of 10 9 members is 10 9 x 10 4 = 10 13 
phage. Preferably, affinity enrichment is accomplished by incubating the target with 
about 10 2 library equivalents up to about 10 6 library equivalents and more preferably 
about 10 3 to 10 4 library equivalents. By varying the ratio between target phage and 
target-binder phage, particularly with the number of target binders exceeding the 
target, the recovery of target binders with higher affinity can be enhanced. When the 
number of target binders exceed the target, the better binders are able to outcompete 
and eliminate the poorer binders in the selection. For example, by adjusting the ratio 
between number of target binders to number of target the number of target binders 
selected, i.e. the population of surviving pairs, can be reduced to less than 300 
members of target binders with better binding affinity. 

[0087] In another example, affinity enrichment is accomplished by adjusting 

the preincubation time prior to transformation of the host cells. The preincubation 
time may range from 2 hours to 24 hours, preferably between 2 hrs and 3 hrs. The 
population of selected target binders is enriched with target binders having higher 
affinities by increasing the preincubation time. The preincubation time should be 
increased so that the two phage are allowed to reach equilibrium or steady state. In 
other examples, affinity enrichment is improved by varying the stringency of the 
preincubation conditions. For example temperature, ionic strength, volume and pH 
are adjusted to enrich the selected population with better target binders and eliminate 
or reduce the number of weaker target binders. The temper ature may range from 
room temperature or 28°C to 42°C, preferably 30°C-37°C, and most preferably 30°C. 
The preferred pH value would be neutral. Varying the temperature, pH, ionic 
strength, divalent cation concentration, and the volume and duration of the 



WO 03/035839 



-26- 



PCT/US02/34021 



preincubation time will select for target binders within particular ranges of affinity 
for the receptor. 

[0088] Additional examples of affinity enrichment are based on slow 

dissociation rates of target binders. A slow dissociation rate is usually predictive of 
high affinity. In these examples of affinity enrichment, the continued incubation of 
the target phage and the target-binder phage is performed in the presence of a 
saturating amount of a known target binder or by increasing the volume of the 
incubation solution. In each case, the rebinding of dissociated target-binder phage is 
prevented, and with increasing time, target-binder phages of higher affinity are 
recovered. 

[0089] The preincubation time and the preincubation conditions are 

optimized for each target/target-binder of interest. To monitor the effect of the 
varying conditions on affinity enrichment pilot experiments of proteome panning are 
performed. After incubation of the target phage and the target-binder phage and 
transformation of the host cells, the host cells are plated out onto selective media and 
quantified. Determining the change in the number of colonies that survive provides 
an easy assessment tool to determine the degree of affinity enrichment. As the 
number of surviving colonies declines, the number of surviving weak binders is 
significantly diminished, leaving fewer target binders with higher affinity. For 
example, the loss of the number of surviving colonies, until only 1%, 0.1%, or 
0.001% survive, indicates optimal conditions for enriching target binders that bind 
the target having higher affinity. In some circumstance, the number of surviving 
colonies could be limited to about 100 colonies for analysis by sequencing. 
Depending on the diversity of the type of target binder library used, the number of 
target binders with a higher affinity may by less than 10. 

[0090] The use of the above affinity-enrichment techniques allows for 

enrichment without necessarily performing additional rounds of panning. The 
affinity-enrichment techniques can be used alone or in combination. It is to be 
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understood that the present invention could also use multiple rounds of proteome 
panning to provide for affinity enrichment if desired. 

Peptides And Polypeptides 

[0091] A further aspect of the present invention pertains to isolated peptides 

such as binders of DGI-2, DGI-5, DGI-7, DGI-9, VEGF, VEGFR1, VEGFR2, 
VEGFR3, EGFR, FGFR1 -alpha, FGFRl-beta, and Tiel (e.g., SEQ ID NOs: 1-189; 
SEQ ID NOs: 227-532). DGI-9 target binder sequences are shown in Table 3. The 
amino acid sequences are represented by their one-letter abbreviations. Binding is 
indicated as the ratio over background. LDH is the control irrelevant protein. 
Numbers in subscript at the end of a clone name indicate the number of times the 
sequence was found in that panning. The peptides of this invention can be isolated, 
synthetic, or recombinant. The target binders may be obtained as individual peptides 
or part of a polypeptide complex. In various aspects, a polypeptide-complex may 
comprise one or more molecules of target in an association with one or more target 
binders (e.g., multiple copies of the same peptide binder or single copies of different 
peptide binders). 

[0092] A further embodiment of the invention is a method of identifying an 

amino acid sequence motif which confers binding properties to a target by screening 
a library of expressed amino acid sequences for binding to the target, determining the 
amino acid sequence of the members of the library which bind to the target, and 
identifying as motifs common amino acid sequences. 

[0093] Yet another embodiment of the invention relates to amino acid 

sequence motifs which bind to the specific target. In addition to binding to the target, 
the amino acid sequences may also possess biological activity, not limited to agonist, 
partial agonist or antagonist activity at the target. 

[0094] The present invention encompasses the target, target binders, and 

fragments and functional equivalents thereof. The term "functional equivalent" is 
intended to include proteins which differ in amino acid sequence from the target or 
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target binder peptides but where such differences result in a modified protein which 
performs at least one characteristic function of the polypeptide (e.g., binding, 
signaling, or antigenic activity). For example, a functional equivalent of a 
polypeptide may have a modification such as a substitution, addition, or deletion of 
an amino acid residue that is not directly involved in the function of this polypeptide. 

[0095J It is also possible to vary the structure of a target or target binder 

peptide for such purposes as increasing solubility, enhancing activity, antigenicity, or 
stability (e.g., shelf life ex vivo and resistance to proteolytic degradation in vivo). 
Such variants are considered functional equivalents of the polypeptides and peptides 
as defined herein. Preferably, polypeptides and peptides are modified so that they 
retain activity. Those residues shown to be essential for activity can be modified by 
replacing the essential amino acid with another, preferably similar amino acid residue 
(a conservative substitution) whose presence is shown to enhance, diminish, but not 
eliminate, or not effect receptor interaction. In addition, those amino acid residues 
that are not essential for binding or other activity can be modified by being replaced 
by another amino acid whose incorporation may enhance, diminish, or not effect 
reactivity. 

[0096] Polypeptide and peptide variants include mutants differing by the 

addition, deletion, or substitution of one or more amino acid residues. Also included 
are modified polypeptides and peptides in which one or more residues are modified, 
and mutants comprising one or more modified residues. Useful modifications may 
include phosphorylation, sulfation, reduction/alkylation (Tarr, 1986, Methods of 
Protein Microcharacterization, J. E. Silver, Ed., Humana Press, Clifton, NJ, pp. 155- 
194); acylation (Tarr, supra); chemical coupling (Mishell and Shiigi (Eds), 1980, 
Selected Methods in Cellular Immunology, W H Freeman, San Francisco, CA; U.S. 
Patent No. 4,939,239); and mild formalin treatment (Marsh, 1971, Int. Arch, of 
Allergy and Appl. Immunol 41:199-215). Additionally, D-amino acids, non-natural 
amino acids, or non-amino acid analogs can be substituted or added to produce a 
modified polypeptide. Furthermore, the polypeptides disclosed herein can be 
modified using polyethylene glycol (PEG) according to known methods (S.L Wie et 



WO 03/035839 



-29- 



PCT/US02/34021 



al., 1981, Int. Arch. Allergy Appl. Immunol. 64(l):84-99) to produce a protein 
conjugated with PEG. In addition, PEG can be added during chemical synthesis of 
the protein. Modifications or sequence variations may occur at the amino- or 
carboxy-terminal positions of the reference polypeptide sequence or anywhere 
between those terminal positions, interspersed either individually among the amino 
acids in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

[00971 Polypeptides or peptides may also be modified with a label capable of 

providing a detectable signal, either directly or indirectly, including, but not limited 
to, radioisotope, fluorescent, and enzyme labels. Fluorescent labels include, for 
example, Cy™3, Cy™5, Alexa, BODIPY, fluorescein (e.g., FluorX, DTAF, and 
FITC), rhodamine (e.g., TRITC), auramine, Texas Red, AMCA blue, and Lucifer 
Yellow. Preferred isotope labels include 3 H, 14 C, 32 P, 35 S, 36 CI, 5J Cr, 57 Co, 58 Co, 
59 Fe ^ 90 Y? 125 ^ i3i I? md i86 Re p re f erre( i enzyme labels include peroxidase, p- 

glucuronidase, (3-D-glucosidase, p-D-galactosidase, urease, glucose oxidase plus 
peroxidase, and alkaline phosphatase (see, e.g., U.S. Pat. Nos. 3,654,090; 3,850,752 
and 4,016,043). Enzymes can be conjugated by reaction with bridging molecules 
such as carbodiimides, diisocyanates, glutaraldehyde, and the like. Enzyme labels 
can be detected visually, or measured by calorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric, or gasometric techniques. Other labeling 
systems, such as avidin/biotin, Tyramide Signal Amplification (TSA™), are known 
in the art, and are commercially available (see, e.g., ABC kit, Vector Laboratories, 
Inc., Burlingame, CA; NEN® Life Science Products, Inc., Boston, MA). 

[0098] Polypeptide and peptide variants may have "conservative" changes, 

wherein a substituted amino acid has similar structural or chemical properties, e.g., 
replacement of leucine with isoleucine. More infrequently, a variant may have 
"nonconservative" changes, e.g., replacement of a glycine with a tryptophan. 
Guidance in determining which amino acid residues can be substituted, inserted, or 
deleted without abolishing biological or immunological activity can be determined 
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using computer programs well known in the art, for example, DNASTAR software 
(DNASTAR, Inc., Madison, WI). 

[0099] Substantial changes in function or immunogenicity can be made by 

selecting substitutions that are less conservative. For example, non-conservative 
substitutions can be made which more significantly affect the structure of the 
polypeptide in the area of the alteration, for example, the alpha-helical, or beta-sheet 
structure; the charge or hydrophobicity of the molecule at the target site; or the bulk 
of the side chain. The substitutions which generally are expected to produce the 
greatest changes in the polypeptide's properties are those where 1) a hydrophilic 
residue, e.g., seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g., 
leucyl, isoleucyl, phenylalanyl, valyl, or alanyl; 2) a cysteine or proline is substituted 
for (or by) any other residue; 3) a residue having an electropositive side chain, e.g., 
lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g., 
glutamyl or aspartyl; or 4) a residue having a bulky side chain, e.g., phenylalanine, is 
substituted for (or by) a residue that does not have a side chain, e.g., glycine. 

[00100] Sequence tags (e.g., FLAG® tags) or amino acids, such as one or 
more lysines, can be added to the peptide sequences of the invention (e.g., at the N- 
terminal or C-terminal ends) as described in detail herein. Sequence tags can be used 
for peptide purification or localization. Lysines can be used to increase peptide 
solubility or to allow for biotinylation. Alternatively, amino acid residues located at 
the carboxy and amino terminal regions of the consensus motifs described below, 
which comprise sequence tags (e.g., FLAG® tags), or which contain amino acid 
residues that are not associated with a strong preference for a particular amino acid, 
may optionally be deleted providing for truncated sequences. Certain amino acids 
(e.g., C-terminal or N-terminal residues) such as lysine which promote the stability or 
biotinylation of the amino acids sequences may be deleted depending on the use of 
the sequence, as for example, expression of the sequence as part of a larger sequence 
which is soluble, or linked to a solid support. 
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[00101] Polypeptide or protein fragments (i.e., peptides) can range in size 
sufficient to contain the minimal number of amino acids to define a binding motif, 
typically from about 5 amino acid residues to all but one residue of the entire amino 
acid sequence. Thus, a peptide can be at least 5, 15, 20, 25, 30, 50, 100, or more 
consecutive amino acid residues of a target or target-binding protein or polypeptide. 
In one embodiment, the percent amino acid sequence identity between a target-binder 
or target-partner polypeptide or peptide, and a functional equivalent thereof is at least 
50%. In a preferred embodiment, the percent amino acid sequence identity is at least 
65%. More preferably, the percent amino acid sequence identity is at least 75%, still 
more preferably, at least 80%, and even more preferably, at least 90%. In one 
embodiment, the percent nucleic acid sequence identity between a target-binder or 
target-partner nucleotide, and a functional equivalent thereof is at least 50%. In a 
preferred embodiment, the percent nucleotide sequence identity is at least 65%. 
More preferably, the percent nucleotide sequence identity is at least 75%, still more 
preferably, at least 80%, and even more preferably, at least 90%. 

[00102] Percent sequence identity can be calculated using computer programs 
or direct sequence comparison. Preferred computer program methods to determine 
identity between two sequences include, but are not limited to, the GCG program 
package, FASTA, BLASTP, and TBLASTN (see, e.g., D.W. Mount, 2001, 
Bioinformatics: Sequence and Genome Analysis, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY). The BLASTP and TBLASTN programs are 
publicly available from NCBI and other sources. The well-known Smith Waterman 
algorithm may also be used to determine identity. 

[00103] Exemplary parameters for amino acid sequence comparison include 
the following: 1) algorithm from Needleman and Wunsch, 1970, J Mol. Biol. 
48:443-453; 2) BLOSSUM62 comparison matrix from Hentikoff and Hentikoff, 
1992, Proc. Natl. Acad. Sci. USA 89:10915-10919; 3) gap penalty = 12; and 4) gap 
length penalty = 4. A program useful with these parameters is publicly available as 
the "gap" program (Genetics Computer Group, Madison, WI). The aforementioned 
parameters are the default parameters for polypeptide comparisons (with no penalty 
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for end gaps). Alternatively, polypeptide sequence identity can be calculated using 
the following equation: % identity = ((the number of identical residues) / (alignment 
length in amino acid residues)) x 100. For this calculation, alignment length includes 
internal gaps but does not include terminal gaps. 

[00104] In accordance with the present invention, polypeptide or peptide 
sequences may be identical to DGI-2 or DGI-2-binder polypeptides or peptides, or 
may include up to a certain integer number of amino acid alterations. Polypeptide 
alterations are selected from the group consisting of at least one amino acid deletion, 
substitution, including conservative and non-conservative substitution, or insertion. 

[00105] The invention also relates to isolated, synthesized and/or recombinant 
portions or fragments of a target or target binder polypeptide as described herein. 
Polypeptide fragments (i.e., peptides) can be made which have full or partial function 
on their own, or which when mixed together (though fully, partially, or nonfunctional 
alone), spontaneously assemble with one or more other polypeptides to reconstitute a 
functional protein having at least one functional characteristic of a target protein of 
this invention. In addition, peptides may comprise, for example, one or more motifs 
of a target binder, disclosed herein. 

[00106] Nucleic acids comprising protein-coding sequences can be used to 
direct the expression polypeptides or peptides in intact cells or in cell-free translation 
systems. The coding sequence can be tailored, if desired, for more efficient 
expression in a given host organism, and can be used to synthesize oligonucleotides 
encoding the desired amino acid sequences. The resulting oligonucleotides can be 
inserted into an appropriate vector and expressed in a compatible host organism or 
translation system. 

[00107] The polypeptides and peptides of the present invention, including 
functional equivalents, may be isolated from wild-type or mutant cells (e.g., human 
cells or cell lines), from heterologous organisms or cells (e.g., bacteria, yeast, insect, 
plant, and mammalian cells), or from cell-free translation systems (e.g., wheat germ, 
microsomal membrane, or bacterial extracts) in which a protein-coding sequence has 
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been introduced and expressed. Furthermore, the polypeptides and peptides may be 
part of recombinant fusion proteins. The polypeptides and peptides can also, 
advantageously, be made by synthetic chemistry. Polypeptides and peptides may be 
chemically synthesized by commercially available automated procedures, including, 
without limitation, exclusive solid phase synthesis, partial solid phase methods, 
fragment condensation or classical solution synthesis, described in detail below. 

[00108] Methods for polypeptide and peptide purification are well-known in 
the art, including, without limitation, preparative disc-gel electrophoresis, isoelectric 
focusing, HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition 
chromatography, and countercurrent distribution. For some purposes, it is preferable 
to produce the polypeptide or peptide in a recombinant system in which the protein 
contains an additional sequence (e.g., epitope or protein) tag that facilitates 
purification. Non-limiting examples of epitope tags include c-myc, haemagglutinin 
(HA), polyhistidine, GLU-GLU, and FLAG® epitope tags. Non-limiting examples 
of protein tags include glutathione-S-transferase (GST), green fluorescent protein 
(GFP), and maltose binding protein (MBP). 

[00109] In one approach, the coding sequence of a polypeptide or peptide can 
be cloned into a vector that creates a fusion with a sequence tag of interest. Suitable 
vectors include, without limitation, pRSET (Invitrogen Corp., San Diego, CA), 
pGEX (Amersham-Pharmacia Biotech, Inc., Piscataway, NJ), pEGFP (CLONTECH 
Laboratories, Inc., Palo Alto, CA), and pMAL™ (New England BioLabs (NEB), 
Inc., Beverly, MA) plasmids. Following expression, the epitope, or protein tagged 
polypeptide or peptide can be purified from a crude lysate of the translation system or 
host cell by chromatography on an appropriate solid-phase matrix. In some cases, it 
may be preferable to remove the epitope or protein tag (i.e., via protease cleavage) 
following purification. As an alternative approach, antibodies produced against a 
protein or peptide can be used as purification reagents. Other purification methods 
are also possible. 
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[00110] Both the naturally occurring and recombinant forms of the 
polypeptides of the invention can advantageously be used to screen compounds for 
binding activity. Many methods of screening for binding activity are known by those 
skilled in the art and may be used to practice the invention. Several methods of 
automated assays have been developed in recent years so as to permit screening of 
tens of thousands of compounds in a short period of time. Such high-throughput 
screening methods are particularly preferred. The use of high-throughput screening 
assays to test for inhibitors is greatly facilitated by the availability of large amounts 
of purified polypeptides and peptides, as provided by the invention. The 
polypeptides and peptides of the invention also find use as therapeutic agents as well 
as antigenic components to prepare antibodies. 

[00111] Many conventional techniques in molecular biology, protein 
biochemistry, and immunology may be used to produce the amino acid sequences for 
use with this invention. To obtain recombinant polypeptides and peptides, coding 
sequences may be cloned into any suitable vectors for expression in intact cells or in 
cell-free translation systems by methods well known in the art (see Sambrook et al., 
1989) and described herein. The particular choice of the vector, cell, or translation 
system is not critical to the practice of the invention. 

[00112] According to methods known in the art, polypeptides and peptides can 
be chemically synthesized by commercially available automated procedures, 
including, without limitation, exclusive solid phase synthesis, partial solid phase 
methods, fragment condensation, classical solution synthesis. In addition, 
recombinant and synthetic methods of peptide production can be combined to 
produce semi-synthetic polypeptides and peptides. The polypeptides and peptides of 
the invention are preferably prepared by solid phase peptide synthesis methods and 
techniques as described by Merrifield, 1963, J. Am. Chem. Soc. 85:2149; J.M. 
Stewart and J.D. Young, 1984, Solid Phase Peptide Synthesis, 2nd edition, Pierce 
Chemical Co., Rockford, IL; W.C. Chan and P.D. White (Eds.), 2000, Fmoc Solid 
Phase Peptide Synthesis: A Practical Approach, Oxford University Press; Kaiser et 
al., 1970, Anal. Biochem. 34:595. 
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[00113] This invention provides specific amino acid sequences that bind DGI- 
2. However, additional sequences may be obtained in accordance with the 
procedures described herein below. 

Antibodies 

[00114] Another aspect of the invention pertains to antibodies directed to the 
target of interest, such as target, target-binders (e.g., SEQ ID NOs: 1-189; SEQ ID 
NOs: 227-532), or fragments or variants thereof. Specifically included are antibodies 
directed to polypeptide-complexes of target and target binders, or fragments of these 
complexes. For example, antibodies directed to polypeptide-complexes of DGI- 
2/DGI-2-binders are complex-specific, i.e., the antibodies do not bind to the 
components separately. The invention provides polyclonal and monoclonal 
antibodies that bind to target, target-binders, or target-partners, fragments, or 
complexes thereof. The antibodies may be elicited in an animal host (e.g., rabbit, 
goat, mouse, or other non-human mammal) by immunization with disorder- 
associated immunogenic components. Antibodies may also be elicited by in vitro 
immunization (sensitization) of immune cells. The immunogenic components used 
to elicit the production of antibodies may be isolated from cells or chemically 
synthesized. The antibodies may also be produced in recombinant systems 
programmed with appropriate antibody-encoding DNA. Alternatively, the antibodies 
may be constructed by biochemical reconstitution of purified heavy and light chains. 
The antibodies include hybrid antibodies, chimeric antibodies, and univalent 
antibodies. Also included are Fab fragments, including Fabl and Fab(ab)2 fragments 
of antibodies. 

[00115] In a further example, an isolated DGI-2, DGI-2-binder peptide, or 
complex thereof, can be used as an immunogen to generate antibodies using standard 
techniques for polyclonal and monoclonal antibody preparation. Full-length 
polypeptides can be used or, alternatively, the invention provides antigenic peptide 
portions of these polypeptides for use as immunogens. The antigenic peptide of 
DGI-2, DGI-2-binder or a DGI-2-partner comprises a sufficient number of 
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contiguous amino acid residues of the amino acid sequence, or a variant thereof, to 
encompass an epitope of a DGI-2, DGI-2-binder, or DGI-2-partner polypeptide such 
that an antibody raised against the peptide forms a specific immune complex with a 
DGI-2 or DGI-2-binder amino acid sequence. Typically, about 5 contiguous amino 
acids are sufficient to define an epitope. 

[00116] An appropriate immunogenic preparation can contain, for example, 1) 
recombinant^ produced DGI-2 or DGI-2-binder polypeptides or peptides; 2) 
chemically synthesized DGI-2 or DGI-2-binder polypeptides or peptides, 3) 
fragments of these polypeptides (i.e., peptides); or 4) complexes comprising these 
polypeptides. The preparation can further include an adjuvant, such as Freund's 
complete or incomplete adjuvant, or similar immunostimulatory agent. A number of 
adjuvants are known and used by those skilled in the art. Non-limiting examples of 
suitable adjuvants include incomplete Freund's adjuvant, mineral gels such as alum, 
aluminum phosphate, aluminum hydroxide, aluminum silica, and surface-active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanin, and dinitrophenol. Further examples of adjuvants 
include stearyl tyrosine (A. Nixon-George et al., 1990, J. Immunol. 144:4798-4802; 
Paoletti, et al., 1997, J. Infect. Diseases 175:1237-9; U.S. Pat. No. 4,258,029 to 
Moloney et al.; U.S. Pat. No. 5,683,699 to Jennings, et al.), N-acetyl-muramyl-L- 
threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine 
(CGP 11637, referred to as nor-MDP), N-acetylmuramyl-Lalanyl-D-isoglutaminyl-L- 
alanine-2-(l , -2 , -dipalmitoyl-sn-glycero-3 hydroxyphosphoryloxy)-ethylamine (CGP 
19835A, referred to as MTP-PE), and RIBI adjuvant (e.g., Detox®, Corixa Corp., 
Seattle WA) which contains three components extracted from bacteria, 
monophosphoryl lipid A, trehalose dimycolate and cell wall skeleton 
(TDM+CWS+MPL®; Corixa Corp.) in a 2% squalene/Tween 80 emulsion. A 
particularly useful adjuvant comprises 5% (wt/vol) squalene, 2.5% Pluronic L121 
polymer and 0.2% polysorbate in phosphate buffered saline (Kwak et al., 1992, New 
Eng. 7. Med. 327:1209-1215). Preferred adjuvants include complete BCG, Detox, 
(RIBI, Immunochem Research Inc.), ISCOMS, and aluminum hydroxide adjuvant 
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(Superphos, Biosector). The effectiveness of an adjuvant may be determined by 
measuring the amount of antibodies directed against the immunogenic peptide. 

[00117] Polyclonal antibodies to targets and target binders, such as, DGI-2 or 
DGI-2-binder polypeptides, peptides, or polypeptide-complexes thereof, can be 
prepared as described above by immunizing a suitable subject with a DGI-2, DGI-2- 
binder, or DGI-2-partner immunogen. The antibody titer in the immunized subject 
can be monitored over time by standard techniques, such as with an enzyme linked 
immunosorbent assay (ELISA) using immobilized DGI-2 DGI-2-binder polypeptide, 
peptide, or polypeptide-complex. If desired, the antibody molecules can be isolated 
from the mammal (e.g., from the blood) and further purified by well-known 
techniques, such as protein A chromatography to obtain the IgG fraction. 

[00118] At an appropriate time after immunization, e.g., when the antibody 
titers are highest, antibody-producing cells can be obtained from the subject and used 
to prepare monoclonal antibodies by standard techniques, such as the hybridoma 
technique (see Kohler and Milstein, 1975, Nature 256:495-497; Brown et al., 1981, J. 
Immunol. 127:539-46; Brown et al., 1980, J. Biol. Chem. 255:4980-83; Yeh et al., 
1976, PNAS 76:2927-31; and Yeh et al., 1982, Int. J. Cancer 29:269-75), the human 
B cell hybridoma technique (Kozbor et al., 1983, Immunol. Today 4:72), the EBV- 
hybridoma technique (Cole et al., 1985, Monoclonal Antibodies and Cancer Therapy, 
Alan R. Liss, Inc., pp. 77-96) or trioma techniques. 

[00119] The technology for producing hybridomas is well-known (see 
generally R. H. Kenneth, 1980, Monoclonal Antibodies: A New Dimension In 
Biological Analyses, Plenum Publishing Corp., New York, NY; E.A. Lerner, 1981, 
Yale J. Biol. Med., 54:387-402; M.L. Gefter et al., 1977, Somatic Cell Genet. 3:231- 
36). In general, an immortal cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a target, target-binder, or 
target-partner immunogen as described above, and the culture supernatants of the 
resulting hybridoma cells are screened to identify a hybridoma producing a 
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monoclonal antibody that binds target, target-binder, or target-partner polypeptides, 
peptides, or polypeptide-complexes. 

[00120] Any of the many well known protocols used for fusing lymphocytes 
and immortalized cell lines can be applied for the purpose of generating an 
monoclonal antibody to a target, target-binder, or target-partner, peptide, or 
polypeptide-complex (see, e.g., G. Galfre et al., 1977, Nature 266:55052; Gefter et 
al., 1977; Lemer, 1981; Kenneth, 1980). Moreover, the ordinarily skilled worker will 
appreciate that there are many variations of such methods. Typically, the immortal 
cell line (e.g., a myeloma cell line) is derived from the same mammalian species as 
the lymphocytes. For example, murine hybridomas can be made by fusing 
lymphocytes from a mouse immunized with an immunogenic preparation of the 
present invention with an immortalized mouse cell line. Preferred immortal cell lines 
are mouse myeloma cell lines that are sensitive to culture medium containing 
hypoxanthine, aminopterin, and thymidine (HAT medium). Any of a number of 
myeloma cell lines can be used as a fusion partner according to standard techniques, 
e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653, or Sp2/0-Agl4 myeloma lines. These 
myeloma lines are available from ATCC (American Type Culture Collection, 
Manassas, VA). Typically, HAT-sensitive mouse myeloma cells are fused to mouse 
splenocytes using polyethylene glycol (PEG). Hybridoma cells resulting from the 
fusion arc then selected using HAT medium, which kills unfused and unproductively 
fused myeloma cells (unfused splenocytes die after several days because they are not 
transformed). Hybridoma cells producing a monoclonal antibody of the invention are 
detected by screening the hybridoma culture supernatants for antibodies that bind 
target, target-binder, or target-partners, peptides, or polypeptide-complexes, e.g., 
using a standard ELISA assay. 

[00121] Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal antibody can be identified and isolated by screening a recombinant 
combinatorial immunoglobulin library (e.g., an antibody phage display library) with 
the corresponding target or target-binder, peptide, or polypeptide-complex to thereby 
isolate immunoglobulin library members that bind these molecules. Kits for 
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generating and screening phage display libraries are commercially available (e.g., the 
Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene SurfZAP™ Phage Display Kit, Catalog No. 240612). 

[00122] Additionally, examples of methods and reagents particularly amenable 
for use in generating and screening antibody display library can be found in, for 
example, A. Blume U.S. Patent No. 6,010,861, Ladner et al. U.S. Pat. No. 5,223,409; 
Kang et al. International Publication No. WO 92/18619; Dower et al. International 
Publication No. WO 91/17271; Winter et al. International Publication WO 92/20791; 
Markland et al. International Publication No. WO 92/15679; Breitling et al. 
International Publication WO 93/01288; McCafferty et al. International Publication 
No. WO 92/01047; Garrard et al. International Publication No. WO 92/09690; 
Ladner et al. International Publication No. WO 90/02809; Fuchs et al., 1991, 
BioTechnology 9:1370-1372; Hay et al., 1992, Hum. Antibod. Hybridomas 3:81-85; 
Huse et al., 1989, Science 246:1275-1281; Griffiths et al., 1993, EMBO J 12:725- 
734; Hawkins et al., 1992, J. Mol. Biol. 226:889-896; Clarkson et al., 1991, Nature 
352:624-628; Gram et al., 1992, PNAS 89:3576-3580; Garrad et al., 1991, 
BioTechnology 9:1373-1377; Hoogenboom et al., 1991, Nuc. Acid Res. 19:4133- 
4137; Barbas et al., 1991, PNAS 88:7978-7982; and McCafferty et al., 1990, Nature 
348:552-55. 

[00123] Additionally, recombinant antibodies to a target or target-binder, 
peptide, or polypeptide-complex, such as chimeric and humanized monoclonal 
antibodies, comprising both human and non-human portions, can be made using 
standard recombinant DNA techniques. Such chimeric and humanized monoclonal 
antibodies can be produced by recombinant DNA techniques known in the art, for 
example using methods described in Robinson et al. International Application No. 
PCT/US86/02269; Akira, et al. European Patent Application 184,187; Taniguchi, M., 
European Patent Application 171,496; Morrison et al. European Patent Application 
173,494; Neuberger et al. PCT International Publication No. WO 86/01533; Cabilly 
et al. U.S. Pat. No. 4,816,567; Cabilly et al. European Patent Application 125,023; 
Better et al., 1988, Science 240:1041-1043; Liu et al., 1987, PNAS 84:3439-3443; 
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Liu et al., 1987, J. Immunol. 139:3521-3526; Sun et al., 1987, PNAS 84:214-218; 
Nishimura et al., 1987, Cane. Res. 47:999-1005; Wood et al., 1985, Nature 314:446- 
449; and Shaw et al., 1988, J. Natl. Cancer Inst. 80:1553-1559; S.L. Morrison, 1985, 
Science 229:1202-1207; Oi et al., 1986, BioTechniques 4:214; Winter U.S. Pat. No. 
5,225,539; Jones et al., 1986, Nature 321:552-525; Verhoeyan et al., 1988, Science 
239:1534; andBcidler et al., 1988, J. Immunol. 141:4053-4060. 

[00124] An antibody against a target or target-binder, peptide, or polypeptide- 
complex (e.g., monoclonal antibody) can be used to isolate the corresponding 
molecules by standard techniques, such as affinity chromatography or 
immunoprecipitation. For example, antibodies can facilitate the purification of a 
natural polypeptides or polypeptide-complexes from cells and of a recombinantly 
produced polypeptides or complexes produced in host cells. In addition, an antibody 
that binds to a target, target-binder, or target-partner, peptide, or complex can be used 
to detect the corresponding molecule (e.g., in a cellular lysate or cell supernatant) in 
order to evaluate patterns or levels of protein expression or complexation. Such 
antibodies can also be used diagnostically to monitor polypeptide or polypeptide- 
complex levels in tissue as part of a clinical testing procedure, e.g., to, for example, 
determine the efficacy of a given treatment regimen as described in detail herein. In 
addition, antibodies to a target, target-binder, or target-partner, peptide, or 
polypeptide-complex can be used as therapeutics for the treatment of diseases related 
to abnormal gene expression or function, e.g., as relating to the development of 
neoplasms and cancers. 

[00125] Alternatively, libraries of natural compounds in the form of bacterial, 
fungal, plant and animal extracts can be readily produced. Methods for the synthesis 
of molecular libraries are readily available (see, e.g., DeWitt et al., 1993, Proc. Natl. 
Acad. Sci. USA 90:6909; Erb et al., 1994, Proc. Natl. Acad. Sci. USA 91:1 1422; 
Zuckermann et al., 1994, J. Med. Chem. 37:2678; Cho et al., 1993, Science 
261:1303; Carell et al., 1994, Angew. Chem. Int. Ed. Engl. 33:2059; Carell et al., 
1994, Angew. Chem. Int. Ed. Engl. 33:2061; and in Gallop et al., 1994, J. Med. 
Chem. 37:1233). In addition, natural or synthetic compound libraries and 
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compounds can be readily modified through conventional chemical, physical and 
biochemical means (see, e.g., Blondelle et aL, 1996, Trends in Biotech. 14:60), and 
may be used to produce combinatorial libraries. In another approach, previously 
identified pharmacological agents can be subjected to directed or random chemical 
modifications, such as acylation, alkylation, esterification, amidification, and the 
analogs can be screened for targets and/or target-partner-modulating activity. 

[00126] Numerous methods for producing combinatorial libraries are known in 
the art, including those involving biological libraries; spatially addressable parallel 
solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the 'one-bead one-compound' library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to polypeptide libraries, while the other four approaches are applicable to 
polypeptide, non-peptide oligomer, or small molecule libraries of compounds (K. S. 
Lam, 1997, Anticancer Drug Des. 12:145). 

[00127] Non-limiting examples of small molecules, small molecule libraries, 
combinatorial libraries, and screening methods are described in B. Seligmann, 1995, 
"Synthesis, Screening, Identification of Positive Compounds and Optimization of 
Leads from Combinatorial Libraries: Validation of Success" p. 69-70. Symposium: 
Exploiting Molecular Diversity: Small Molecule Libraries for Drug Discovery, La 
Jolla, CA, Jan. 23-25, 1995 (conference summary available from Wendy Warr & 
Associates, 6 Berwick Court, Cheshire, UK CW4 7HZ); E. Martin et aL, 1995, J. 
Med. Chem. 38:1431-1436; E. Martin et al., 1995, "Measuring diversity: 
Experimental design of combinatorial libraries for drug discovery" Abstract, ACS 
Meeting, Anaheim, CA, COMP 32; and E. Martin, 1995, "Measuring Chemical 
Diversity: Random Screening or Rationale Library Design" p. 27-30, Symposium: 
Exploiting Molecular Diversity: Small Molecule Libraries for Drug Discovery, La 
Jolla, Calif. Jan. 23-25, 1995 (conference summary available from Wendy Warr & 
Associates, 6 Berwick Court, Cheshire, UK CW4 7HZ). 
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[00128] Libraries may be screened in solution (e.g., Houghten, 1 992, 
Biotechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips 
(Fodor, 1993, Nature 364:555-556), bacteria or spores (Ladner U.S. Pat. No. 
5,223,409), plasmids (Cull et aL, 1992, Proc. Natl. Acad. Sci. USA 89:1865-1869), 
or on phage (Scott and Smith, 1990, Science 249:386-390; Devlin, 1990, Science 
249:404-406; Cwirla et al., 1990, Proc. Natl. Acad. Sci. USA 97:6378-6382; Felici, 
1991, J. Mol. Biol. 222:301-310; Ladner, supra). 

[00129] Where the screening assay is a binding assay, a target or target-binder 
polypeptide, polynucleotide, variant, polypeptide-complex, or fragment thereof, may 
be joined to a label, where the label can directly or indirectly provide a detectable 
signal. Various labels include radioisotopes, fluorophores, chemiluminescent 
compounds, enzymes, specific binding molecules, particles, e.g., magnetic particles, 
and the like. Specific binding molecules include pairs, such as biotin and 
streptavidin, digoxin and antidigoxin, etc. For the specific binding members, the 
complementary member would normally be labeled with a molecule that provides for 
detection, in accordance with known procedures. 

[00130] A variety of other reagents may be included in the screening assay. 
These include reagents like salts, neutral proteins, e.g., albumin, detergents, etc., 
which are used to facilitate optimal protein-protein binding and/or reduce non- 
specific or background interactions. Reagents that improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., 
may be used. The components are added in any order that produces the requisite 
binding. Incubations are performed at any temperature that facilitates optimal 
activity, typically between 4° and 40°C. Incubation periods are selected for optimum 
activity, but may also be optimized to facilitate rapid high-throughput screening. 
Normally, between 0.1 and 1 hr will be sufficient. In general, a plurality of assay 
mixtures is run in parallel with different agent concentrations to obtain a differential 
response to these concentrations. Typically, one of these concentrations serves as a 
negative control, i.e., at zero concentration or below the level of detection. 
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[00131] To perform cell-free ligand screening assays, it may be desirable to 
immobilize either a target or target-binder polypeptide, polynucleotide, fragment, or 
polypeptide-complex, to a surface to facilitate identification of ligands that bind to 
these molecules, as well as to accommodate automation of the assay. For example, a 
fusion protein comprising a DGI-2 polypeptide and an affinity tag can be produced. 
In one embodiment, a glutathione-S-transferase/phosphodiesterase fusion protein 
comprising a DGI-2 polypeptide is adsorbed onto glutathione sepharose beads 
(Sigma Chemical, St. Louis, MO) or glutathione-derivatized microtiter plates. Cell 
lysates (e.g., containing 35S-labeled polypeptides) are added to the coated beads 
under conditions to allow complex formation (e.g., at physiological conditions for 
salt and pH). Following incubation, the coated beads are washed to remove any 
unbound polypeptides, and the amount of immobilized radiolabel is determined. 
Alternatively, the complex is dissociated and the radiolabel present in the supernatant 
is determined. In another approach, the beads are analyzed by SDS-PAGE to identify 
the bound polypeptides. 

[00132] Target binders or ligand binding assays can be used to identify agonist 
or antagonists that alter the function or levels of a target polypeptide. Such assays 
are designed to detect the interaction of test agents (e.g., small molecules) with target 
or polypeptides, polynucleotides, polypeptide-complexes, or fragments or portions 
thereof. Interactions may be detected by direct measurement of binding. 
Alternatively, interactions may be detected by indirect indicators of binding, such as 
stabilization/destabilization of protein structure, or activation/inhibition of biological 
function. Non-limiting examples of useful ligand-binding assays are detailed below. 

[00133] Ligands that bind to targets, for example, DGI-2 polypeptides, 
polynucleotides, polypeptide-complexes, or fragments or portions thereof, can be 
identified using real-time Bimolecular Interaction Analysis (BIA; Sjolander et al., 
1991, Anal. Chem. 63:2338-2345; Szabo et al., 1995, Curr. Opin. Struct. Biol. 5:699- 
705). BIA-based technology (e.g., BIAcoreTM; LKB Pharmacia, Sweden) allows 
study of biospecific interactions in real time, without labeling. In BIA, changes in 
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the optical phenomenon surface plasmon resonance (SPR) is used determine real- 
time interactions of biological molecules. 

[00134] Ligands can also be identified by scintillation proximity assays (SPA, 
described in U.S. Patent No. 4,568,649). In a modification of this assay that is 
currently undergoing development, chaperonins are used to distinguish folded and 
unfolded proteins. A tagged protein is attached to SPA beads, and test agents are 
added. The bead is then subjected to mild denaturing conditions (such as, e.g., heat, 
exposure to SDS, etc.) and a purified labeled chaperonin is added. If a test agent 
binds to a target, the labeled chaperonin will not bind; conversely, if no test agent 
binds, the protein will undergo some degree of denaturation and the chaperonin will 
bind. 

[00135] Ligands can also be identified using a binding assay based on 
mitochondrial targeting signals (Hurt et al., 1985, EMBO J. 4:2061-2068; Eilers and 
Schatz, 1986, Nature 322:228-231). In a mitochondrial import assay, expression 
vectors are constructed in which nucleic acids encoding particular target proteins are 
inserted downstream of sequences encoding mitochondrial import signals. The 
chimeric proteins are synthesized and tested for their ability to be imported into 
isolated mitochondria in the absence and presence of test compounds. A test 
compound that binds to the target protein should inhibit its uptake into isolated 
mitochondria in vitro. 

[00136] The ligand-binding assay described in Fodor et al., 1 99 1 , Science 
251 :767-773, which involves testing the binding affinity of test compounds for a 
plurality of defined polymers synthesized on a solid substrate, can also be used. 

[00137] Several methods of automated assays have been developed in recent 
years so as to permit screening of tens of thousands of test agents in a short period of 
time. High-throughput screening methods are particularly preferred for use with the 
present invention. The assays described herein can be adapted for high-throughput 
screens, or alternative screens may be employed. For example, continuous format 
high throughput screens (CF-HTS) using at least one porous matrix allows the 
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researcher to test large numbers of test agents for a wide range of biological or 
biochemical activity (see United States Patent No. 5,976,813 to Beutel et al.)- 
Moreover, CF-HTS can be used to perform multi-step assays. 

[00138] Other useful screening assays are based on those disclosed in 
International application WO 96/04557, which is incorporated herein in its entirety. 
Briefly, WO 96/04557 discloses the use of reporter peptides that bind to active sites 
on targets and possess agonist or antagonist activity at the target. These reporters are 
identified from recombinant libraries and are either peptides with random amino acid 
sequences or variable antibody regions with at least one CDR region that has been 
randomized (rVab). The reporter peptides may be expressed in cell recombinant 
expression systems, such as for example in E. coli, or by phage display (see WO 
96/04557 and Kay et al. 1996, Mol. Divers. 1(2): 139-40, both of which are 
incorporated herein by reference). The reporters identified from the libraries may 
then be used in accordance with this invention either as therapeutics themselves, or in 
competition binding assays to screen for other molecules, preferably small, active 
molecules, which possess similar properties to the reporters and may be developed as 
drug candidates to provide agonist or antagonist activity. Preferably, these small 
organic molecules are orally active. 

[00139] Additionally, an in vitro competitive receptor binding assay can be 
used as the basis of a heterogeneous screen for small organic molecular replacements 
for target-binders. Occupation of the active site of a target, e.g. DGI-2 can be 
quantified by time-resolved fluorometric detection (TRFD) with streptavidin-labeled 
europium (saEu) complexed to biotinylated peptides (bP). In this assay, saEu can 
form a ternary complex with bP and DGI-2 (i.e., DGI-2:bP:saEu complex). The 
TRFD assay format is well-established, sensitive, and quantitative (Tompkins et al., 
1993, J. Immunol. Methods 163:209-216). The assay can use a single-chain antibody 
or a biotinylated peptide. 

[00140] In such assays, soluble DGI-2, for example, is coated on the surface of 
microtiter wells, blocked by a solution of 0.5% bovine serum albumin (BSA) and 2% 
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non-fat milk in PBS, and then incubated with biotinylated peptide or rVab. Unbound 
bP is then washed away and saEu is added to complex with receptor-bound bP. 
Upon addition of the acidic enhancement solution, the bound europium is released as 
free Eu 3+ which rapidly forms a highly fluorescent and stable complex with 
components of the enhancement solution. The DGI-2:bP bound saEu is then 
converted into its highly fluorescent state and detected by a detector such as Wallac 
Victor n (EG&G Wallac, Inc.) 

Diagnostics 

[00141] As discussed herein below, targets, such as DGI-2, may be associated 
with various cellular processes, including cell growth, proliferation, and attachment. 
Defects in these processes can lead to the development of neoplasms, including 
metastatic cancers. Non-limiting examples of cancers include bone cancer, brain 
cancer, breast cancer, endocrine system cancers, gastrointestinal cancers (e.g., 
colorectal and pancreatic cancers) male genituorinary cancers (e.g., prostate cancer), 
germ cell cancers, gynecologic cancers (e.g., ovarian, cervical, endometrial, and 
vulvar cancers), head and neck cancers, leukemia, lung cancer, lymphomas (e.g., 
Hodgkin f s and non-Hodgkin's lymphomas), skin cancers (e.g., myelomas and 
sarcomas), and urinary cancers (e.g., bladder and kidney cancers). Metastatic cancers 
include, but are not limited to, cancers affecting bone, breast, lung, brain, spinal cord, 
skin, ovaries, bladder, and gastrointestinal tissues. Other possible indications may 
include diseases of the central nervous system and the cardiovascular system. 

[00142] Hras (Harvey Rat sarcoma viral oncogene homolog) may be involved 
in the regulation of growth and/or differentiation of neural cells during development 
in autistic children, for example. Hras may also be involved in non-melanoma skin 
cancer and tumor formation. Leptin is involved in activating the sympathetic nervous 
system, in regulation of blood pressure, hematopoiesis, immune function, 
angiogenesis and brain, bone and pituitary development. Vascular endothelial 
growth factor receptor (VEGFR) may play an important role in vascular diseases, 
especially in cellular signaling: growth, differentiation, cytokine and vascular 
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regulation. Tie plays a similar role to VEGFR and may be involved in the later 
stages of vessel growth and remodeling. Diseases involving VEGFR include, but are 
not limited to pre-eclampsia or intra-uterine growth retardation, leukemias, and 
gliomas. Epidermal growth factor receptor (EGFR) is involved in a wide variety of 
cancers such as breast, squamous cell carcinoma, prostate and colorectal carcinoma, 
and psoriasis. Fibroblast growth factor receptor (FGFR) is involved in growth and 
differentiation signaling, where colon and prostate cancers are just two examples. 
The present invention therefore provides compositions (e.g., diagnostic reagents) 
comprising DGI-2 or DGI-2-binder polynucleotides, polypeptides or peptides (e.g., 
SEQ ID NOs: 1-189) as well as for Hras, Leptin, VEGFR, FGFR, EGFR, to name a 
few, antibodies, and fragments thereof that can be useful in diagnosing and 
monitoring the treatment of these conditions. One or more of these peptides are also 
suitable components for diagnostic kits. 

Antibody-Based Diagnostics 

[00143] In one embodiment of the present invention, antibodies which 
specifically bind to a target or target-binder polypeptide or peptide complex may be 
used for the diagnosis of conditions or diseases characterized by altered levels of 
target or target-binder polypeptides or peptide complexes. Alternatively, such 
antibodies may be used in assays to monitor patients being treated with a target or 
target-binder polypeptide, polynucleotide, antibody, or modulator. 

[00144] The antibodies useful for diagnostic purposes may be prepared in the 
same manner as those for use in therapeutic methods, described herein. Antibodies 
may be raised to a full-length target polypeptide. Alternatively, the antibodies may 
be raised to portions or variants of these polypeptides. In one aspect of the invention, 
antibodies are prepared to bind to a target polypeptide fragment, or target binder, 
such as DGI-2 or DGI-2 binder, comprising one or more domains of the polypeptide 
(e.g., PH, GEF, or kinase domains), as described in detail herein. 
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[00145] Diagnostic assays for a target polypeptide include methods that utilize 
the antibody and a label to detect the protein in biological samples (e.g., human body 
fluids, cells, tissues, or extracts of cells or tissues). The antibodies may be used with 
or without modification, and may be labeled by joining them, either covalently or 
non-covalently, with a reporter molecule. A wide variety of reporter molecules that 
are known in the art may be used, several of which are described herein. 

[00146] Many immunoassay formats are known in the art, and the particular 
format used is determined by the desired application. An immunoassay can use, for 
example, a monoclonal antibody directed against a single disease-associated epitope, 
a combination of monoclonal antibodies directed against different epitopes of a single 
disease-associated antigenic component, monoclonal antibodies directed towards 
epitopes of different disease-associated antigens, polyclonal antibodies directed 
towards the same disease-associated antigen, or polyclonal antibodies directed 
towards different disease-associated antigens. Protocols can also, for example, use 
solid supports, or may involve immunoprecipitation. Typically, immunoassays use 
either a labeled antibody or a labeled antigenic component (i.e., to compete with the 
antigen in the sample for binding to the antibody). Exemplary labels are described in 
the sections shown above. 

[00147] In accordance with the present invention, "competitive" (U.S. Pat. 
Nos. 3,654,090 and 3,850,752), "sandwich" (U.S. Pat. No. 4,016,043), and "double 
antibody," or "DASP" assays may be used. Several procedures for measuring the 
amount of target, target-binder or a target-partner polypeptide in a sample (e.g., 
ELISA, RIA, and FACS) are known in the art and provide a basis for diagnosing 
altered or abnormal levels of polypeptides or polypeptide complexes. Normal or 
standard values for a polypeptide or polypeptide complex are established by 
incubating biological samples taken from normal subjects, preferably human, with 
antibody to a polypeptide or polypeptide complex under conditions suitable for 
association. The amount of standard antibody-antigen association may be quantified 
by various methods; photometric means are preferred. Levels of the polypeptide or 
polypeptide complex in the subject sample, negative control (normal) sample, and 
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positive control (disease) sample are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing 
disease. 

[00148] Kits suitable for antibody-based diagnostic applications typically 
include one or more of the following components: 

[001491 (1) Antibodies: The antibodies may be pre-labeled. Alternatively, 
the antibody may be unlabeled and the ingredients for labeling may be included in 
the kit in separate containers, or a secondary, labeled antibody is provided. 
Antibodies may be monoclonal or polyclonal, and may be directed to target or target- 
binders, or peptide complexes thereof; and 

[00150] (2) Reaction components: The kit may also contain other suitably 
packaged reagents and materials needed for the particular immunoassay protocol, 
including solid-phase matrices or beads, if applicable, and standards. 

[00151] The kits referred to above may include instructions for conducting the 
test. Furthermore, in preferred embodiments, the diagnostic kits are adaptable to 
high-throughput and/or automated operation. 

[00152] A sample to be analyzed, such as, for example, a tissue sample (e.g., 
hair or buccal cavity) or body fluid sample (e.g., blood, saliva, or urine), may be 
contacted directly with the nucleic acid probes. Alternatively, the sample may be 
treated to extract the nucleic acids contained therein. It will be understood that the 
particular method used to extract DNA will depend on the nature of the biological 
sample. The resulting nucleic acid from the sample may be subjected to gel 
electrophoresis or other size separation techniques, or, the nucleic acid sample may 
be immobilized on an appropriate solid matrix without size separation. 

Therapeutics 

[00153] As indicated herein below, target and target-binders may be associated 
with various cell processes, including cell growth, proliferation, and adhesion. 
Uncontrolled activation of cell growth and proliferation can lead to oncogenesis, 
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whereas inhibition of cell attachment can lead to neoplastic growth and metastasis. 
Non-limiting examples of cancers include bone cancer, brain cancer, breast cancer, 
endocrine system cancers, gastrointestinal cancers (e.g., colorectal and pancreatic 
cancers) male genituorinary cancers (e.g., prostate cancer), germ cell cancers, 
gynecologic cancers (e.g., ovarian, cervical, endometrial, and vulvar cancers), head 
and neck cancers, leukemia, lung cancer, lymphomas (e.g., Hodgkin's and non- 
Hodgkin's lymphomas), skin cancers (e.g., myelomas and sarcomas), and urinary 
cancers (e.g., bladder and kidney cancers). Metastatic cancers include, but are not 
limited to, cancers affecting bone, breast, lung, brain, spinal cord, skin, ovaries, 
bladder, and gastrointestinal tissues. Other possible indications may include, but are 
not limited to disease of the CNS and the cardiovascular system. 

Drug screening and design 

[001 54] The present invention provides methods of screening for drugs using 
target or target-binder polypeptides, peptides, or polypeptide-complexes thereof, in 
competitive binding assays, according to methods well-known in the art. For 
example, competitive drug screening assays can be employed using a complex 
comprising target and a target-binder, and screening for a test compound that 
disrupts, enhances, or otherwise alters the polypeptide-complex. 

[00155] The present invention further provides methods of rational drug design 
employing a target or target-binder polynucleotide, polypeptide, antibody, 
polypeptide-complex, or portion or functional equivalent thereof. The goal of 
rational drug design is to produce structural analogs of biologically active 
polypeptides of interest or of small molecules with which they interact (e.g., agonists, 
antagonists, or inhibitors). In turn, these analogs can be used to fashion drugs which 
are, for example, more active or stable forms of the polypeptide, or which, e.g., 
enhance or interfere with the function of the polypeptide in vivo (see, e.g., Hodgson, 
1991, BioTechnology, 9:19-21). An example of rational drug design is the 
development of HIV protease inhibitors (Erickson et al., 1990, Science, 249:527- 
533). 
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[00156] Non-limiting examples of methods and computer tools for drug design 
are described in R. Cramer et al., 1974, J. Med. Chem. 17:533; H. Kubinyi (ed) 1993, 
3D QSAR in Drug Design, Theory, Methods, and Applications, ESCOM, Leiden, 
Holland; P. Dean (ed) 1995, Molecular Similarity in Drug Design, K. Kim 
"Comparative molecular field analysis (ComFA)" p. 291-324, Chapman & Hill, 
London, UK; Y. et al., 1993, J. Comp.-Aid. MoL Des. 7:83-102; G. Lauri and P.A. 
Bartlett, 1994, J. Comp.-Aid. Mol. Des. 8:51-66; P.J. Gane and P.M. Dean, 2000, 
Curr. Opin. Struct. Biol. 10(4):401-4; H.O. Kim and M. Kahn, 2000, Comb. Chem. 
High Throughput Screen. 3(3):167-83; G.K. Farber, 1999, Pharmacol Ther. 
84(3):327-32; and H. van de Waterbeemd (Ed.) 1996, Structure-Property Correlations 
in Drug Research, Academic Press, San Diego, CA. 

[00157] In another aspect of the present invention, cells and animals that carry 
a human target gene or an variant thereof can be used as model systems to study and 
test for substances that have potential as therapeutic agents. After a test agent is 
administered to animals or applied to the cells, the phenotype of the animals/cells can 
be determined. 

[00158] In accordance with these methods, one may design drugs that result in, 
for example, altered target or target-binder activity or stability. Such drugs may act 
as inhibitors, agonists, or antagonists of these polypeptides, or the complexes formed 
by these peptides. By virtue of the availability of target and target-binder nucleotide 
sequences, sufficient amounts of these polypeptides may be produced to perform 
such analytical studies as x-ray crystallography. In addition, the knowledge of the 
target and target-binder polypeptide sequences will guide those employing computer- 
modeling techniques in place of, or in addition to x-ray crystallography. 

Pharmaceutical Compositions 

[00159] The present invention contemplates compositions comprising a target 
or target-binder polynucleotide, polypeptide, antibody, ligand (e.g., agonist, 
antagonist, or inhibitor), polypeptide-complex, or fragments or variants thereof, and a 
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physiologically acceptable carrier, excipient, or diluent. The present invention 
further contemplates pharmaceutical compositions useful in practicing the therapeutic 
methods of this invention. Preferably, a pharmaceutical composition includes, in 
admixture, a pharmaceutically acceptable excipient (carrier) and one or more of a 
target or target-binder polynucleotide, polypeptide, ligand, antibody, polypeptide- 
complex, or fragment, portion, or variant thereof, as described herein, as an active 
ingredient. The preparation of pharmaceutical compositions that contain target or 
target-binder molecules as active ingredients is well understood in the art. Typically, 
such compositions are prepared as injectables, either as liquid solutions or 
suspensions, however, solid forms suitable for solution in, or suspension in, liquid 
prior to injection can also be prepared. The preparation can also be emulsified. The 
active therapeutic ingredient is often mixed with excipients that are pharmaceutically 
acceptable and compatible with the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like and combinations 
thereof. In addition, if desired, the composition can contain minor amounts of 
auxiliary substances such as wetting or emulsifying agents, pH-buffering agents, 
which enhance the effectiveness of the active ingredient. 

[00160] A target or target-binder polynucleotide, polypeptide, ligand, 
antibody, polypeptide-complex, or fragment, portion, or variant thereof can be 
formulated into the pharmaceutical composition as neutralized physiologically 
acceptable salt forms. Suitable salts include the acid addition salts (i.e., formed with 
the free amino groups of the polypeptide or antibody molecule) and which are formed 
with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such 
organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed from the 
free carboxyl groups can also be derived from inorganic bases such as, for example, 
sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases 
as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the 
like. 

[00161] The pharmaceutical compositions can be administered systemically by 
oral or parenteral routes. Non-limiting parenteral routes of administration include 
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subcutaneous, intramuscular, intraperitoneal, intravenous, transdermal, inhalation, 
intranasal, intra-arterial, intrathecal, enteral, sublingual, or rectal. In one particular 
embodiment of the present invention, the disclosed pharmaceutical compositions are 
administered via mucoactive aerosol therapy (see, e.g., M. Fuloria and B.K. Rubin, 
2000, Respir. Care 45:868-873; I. Gonda, 2000, J. Pharm. Sci. 89:940-945; R. 
Dhand, 2000, Curr. Opin. Pulm. Med. 6(l):59-70; B.K. Rubin, 2000, Respir. Care 
45(6):684-94; S. Suarez and AJ. Hickey, 2000, Respir. Care. 45(6):652-66). 
Intravenous administration, for example, can be performed by injection of a unit 
dose. The term "unit dose" when used in reference to a pharmaceutical composition 
of the present invention refers to physically discrete units suitable as unitary dosage 
for humans, each unit containing a predetermined quantity of active material 
calculated to produce the desired therapeutic effect in association with the required 
diluent; i.e., carrier, or vehicle. 

[00162] Pharmaceutical compositions are administered in a manner compatible 
with the dosage formulation, and in a therapeutically effective amount. The quantity 
to be administered depends on the subject to be treated, capacity of the subject's 
immune system to utilize the active ingredient, and degree of modulation of target, 
target-binder or target-partner activity desired. Precise amounts of active ingredient 
required to be administered depend on the judgment of the practitioner and are 
specific for each individual. However, suitable dosages may range from about 0.1 to 
20, preferably about 0.5 to about 10, and more preferably one to several, milligrams 
of active ingredient per kilogram body weight of individual per day and depend on 
the route of administration. Suitable regimes for initial administration and booster 
shots are also variable, but are typified by an initial administration followed by 
repeated doses at one or more hour intervals by a subsequent injection or other 
administration. Alternatively, continuous intravenous infusions sufficient to maintain 
concentrations of 10 nM to 10 in the blood are contemplated. An exemplary 
pharmaceutical formulation comprises: DGI-2 antagonist or inhibitor (5.0 mg/ml); 
sodium bisulfite USP (3.2 mg/ml); disodium edetate USP (0.1 mg/ml); and water for 
injection q.s.a.d. (1.0 ml). As used herein, "pg" means picogram, "ng" means 
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nanogram, "jxg" means microgram, "mg" means milligram, "pi" means microliter, 
"ml" means milliliter, and "1" means L. 

[00163] For further guidance in preparing pharmaceutical formulations, see, 
e.g., Gilman et al. (eds), 1990, Goodman and Oilman's: The Pharmacological Basis 
of Therapeutics, 8th ed., Pergamon Press; and Remington's Pharmaceutical Sciences, 
17th ed., 1990, Mack Publishing Co., Easton, PA; Avis et al. (eds), 1993, 
Pharmaceutical Dosage Forms: Parenteral Medications, Dekker, New York; 
Lieberman et al. (eds), 1990, Pharmaceutical Dosage Forms: Disperse Systems, 
Dekker, New York. 

[00164] In yet another aspect of this invention, antibodies that specifically 
react with a target or target-binder polypeptide, peptides, or polypeptide-complexes 
comprised thereof can be used as therapeutics. In particular, such antibodies can be 
used to block the activity of a target or target-binder polypeptide or target/target- 
binder complex. Antibodies or fragments thereof can be formulated as 
pharmaceutical compositions and administered to a subject. It is noted that antibody- 
based therapeutics produced from non-human sources can cause an undesired 
immune response in human subjects. To minimize this problem, chimeric antibody 
derivatives can be produced. Chimeric antibodies combine a non-human animal 
variable region with a human constant region. Chimeric antibodies can be 
constructed according to methods known in the art (see Morrison et al., 1985, Proc. 
Natl. Acad. ScL USA 81:6851; Takeda et al., 1985, Nature 314:452; U.S. Patent No. 
4,816,567 of Cabilly et al.; U.S. Patent No. 4,816,397 of Boss et al.; European Patent 
Publication EP 171496; EP 0173494; United Kingdom Patent GB 2177096B). 

[00165] In addition, antibodies can be further "humanized" by any of the 
techniques known in the art, (e.g., Teng et al., 1983, Proc. Natl. Acad. Sci. USA 
80:7308-7312; Kozbor et al., 1983, Immunology Today 4: 7279; Olsson et al., 1982, 
Meth. Enzymol. 92:3-16; International Patent Application WO92/06193; EP 
0239400). Humanized antibodies can also be obtained from commercial sources 
(e.g., Scotgen Limited, Middlesex, England). Immunotherapy with a humanized 
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antibody may result in increased long-term effectiveness for the treatment of chronic 
disease situations or situations requiring repeated antibody treatments.Suitable gene 
transfer vectors possess a promoter sequence, preferably a promoter that is cell- 
specific and placed upstream of the sequence to be expressed. The vectors may also 
contain, optionally, one or more expressible marker genes for expression as an 
indication of successful transfection and expression of the nucleic acid sequences 
contained in the vector. In addition, vectors can be optimized to minimize undesired 
immunogenicity and maximize long-term expression of the desired gene product(s) 
(see Nabe, 1999, Proc. Natl Acad. Sci. USA 96:324-326). Moreover, vectors can be 
chosen based on cell-type that is targeted for treatment. Notably, gene transfer 
therapies have been initiated for the treatment of various pulmonary diseases (see, 
e.g., M.J. Welsh, 1999, J. Clin. Invest. 1 04(9): 1 165-6; D.L. Ennist, 1999, Trends 
Pharmacol. Sci. 20:260-266; S.M. Albelda et al., 2000, Ann. Intern. Med. 132:649- 
660; E. Alton and C. Kitson C, 2000, Expert Opin. Investig. Drugs. 9(7): 1523-35). 

[00166] Illustrative examples of vehicles or vector constructs for transfection 
or infection of the host cells include replication-defective viral vectors, DNA virus or 
RNA virus (retrovirus) vectors, such as adenovirus, herpes simplex virus and adeno- 
associated viral vectors. Adeno-associated virus vectors are single stranded and 
allow the efficient delivery of multiple copies of nucleic acid to the cell's nucleus. 
Preferred are adenovirus vectors. The vectors will normally be substantially free of 
any prokaryotic DNA and may comprise a number of different functional nucleic 
acid sequences. An example of such functional sequences may be a DNA region 
comprising transcriptional and translational initiation and termination regulatory 
sequences, including promoters (e.g., strong promoters, inducible promoters, and the 
like) and enhancers which are active in the host cells. Also included as part of the 
functional sequences is an open reading frame (polynucleotide sequence) encoding a 
protein of interest. Flanking sequences may also be included for site-directed 
integration. In some situations, the 5 '-flanking sequence will allow homologous 
recombination, thus changing the nature of the transcriptional initiation region, so as 
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to provide for inducible or non-inducible transcription to increase or decrease the 
level of transcription, as an example. 

[00167] In general, the encoded and expressed target or target-binder 
polypeptide may be intracellular, i.e., retained in the cytoplasm, nucleus, or in an 
organelle, or may be secreted by the cell. For secretion, the natural signal sequence 
present in the polypeptide may be retained. When the polypeptide or peptide is a 
fragment of a full-length protein, a signal sequence may be provided so that, upon 
secretion and processing at the processing site, the desired protein will have the 
natural sequence. Specific examples of coding sequences of interest for use in 
accordance with the present invention include the target or target-binder polypeptide- 
coding sequences shown in the GenBank and GenPept entries described herein. 

[00168] As previously mentioned, a marker may be present for selection of 
cells containing a vector construct. The marker may be an inducible or non-inducible 
gene and will generally allow for positive selection under induction, or without 
induction, respectively. Examples of marker genes include neomycin, dihydro folate 
reductase, glutamine synthetase, and the like. The vector employed will generally 
also include an origin of replication and other genes that are necessary for replication 
in the host cells, as routinely employed by those having skill in the art. As an 
example, the replication system comprising the origin of replication and any proteins 
associated with replication encoded by a particular virus may be included as part of 
the construct. The replication system must be selected so that the genes encoding 
products necessary for replication do not ultimately transform the cells. Such 
replication systems are represented by replication-defective adenovirus (see G. 
Acsadi et al., 1994, Hum. Mol Genet, 3:579-584) and by Epstein-Barr virus. 
Examples of replication defective vectors, particularly, retroviral vectors that are 
replication defective, are BAG, (see Price et al., 1987, Proc. Natl Acad. Sci. USA, 
84:156; Sanes et al., 1986, EMBOJ., 5:3133). It will be understood that the final 
gene construct may contain one or more genes of interest, for example, a gene 
encoding a bioactive metabolic molecule. In addition, cDNA, synthetically produced 
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DNA or chromosomal DNA may be employed utilizing methods and protocols 
known and practiced by those having skill in the art. 

[00169] According to one approach for gene therapy, a vector encoding a 
target polypeptide is directly injected into the recipient cells (in vivo gene therapy). 
Alternatively, cells from the intended recipients are explanted, genetically modified 
to encode a target polypeptide, and reimplanted into the donor (ex vivo gene therapy). 
An ex vivo approach provides the advantage of efficient viral gene transfer, which is 
superior to in vivo gene transfer approaches. In accordance with ex vivo gene 
therapy, the host cells are first transfected with engineered vectors containing at least 
one gene encoding a target or target-binder polypeptide, suspended in a 
physiologically acceptable carrier or excipient such as saline or phosphate buffered 
saline, and the like, and then administered to the host. The desired gene product is 
expressed by the injected cells, which thus introduce the gene product into the host. 
The introduced gene products can thereby be utilized to treat or ameliorate a disorder 
(e.g., neoplastic growth or cancer) that is related to altered levels, activities, and/or 
interactions of the target, target-binder or target-partner polypeptides. 

Sequence Analysis 

[00170] Publicly available sequence databases, e.g., GenBank, GenPept, 
SWISS-PROT, Protein Data Bank (PDB), Protein Information Resource (PIR), 
Human UniGene (National Center for Biotechnology Information), can be used to 
determine sequences that share homology with target, target-binder, or target-partner 
nucleotide or amino acid sequences, or fragments, or sequences obtained for other 
targets, thereof. Alternatively, privately owned sequence databases, e.g., the Incyte 
Genomics sequence database (Incyte Genomics), can be used. Databases with 
relatively few redundant sequences, e.g., PIR or SWISS-PROT databases, may be 
used to improve the statistical significance of a sequence match. However, databases 
which are more comprehensive and up-to-date, e.g., GenBank, GenPept, and Incyte 
Genomics sequence databases (Incyte Genomics, Inc., St. Louis, MO), are preferred. 



WO 03/035839 



-58- 



PCT/US02/34021 



[00171] Any method known in the art can be used to align and compare a 
DGI-2, DGI-2-binder, or DGI-2-partner sequence with the sequences present in a 
sequence database. Preferably, the BLAST (Basic Local Alignment Search Tool) 
program is used (S.F. Altschul et al., 1990, J. MoL Biol 215:403-410; S. Karlin et al., 
1990, Proc. Natl Acad. Sci. USA 87:2264-68; S. Karlin et al., 1993, Proc. Natl 
Acad. Sci. USA 90:5873-7). BLAST identifies local alignments between the 
sequence of the previously identified protein and the protein sequences in the 
database, and predicts the probability of the local alignment occurring by chance. 
Although the original BLAST programs utilized ungapped local alignments, more 
recently developed BLAST programs such as WU-BLAST2/BLAST v2.0 (S. F. 
Altschul et al., 1996, Methods Enzymol. 266:460-480) have been modified to 
incorporate gapped local alignments similar to SSEARCH (T.F. Smith et al., 1981, J. 
MoL Biol 147:195-197) and FASTA programs (W.R. Pearson, 1990, Methods 
Enzymol 183:63-98). In addition, position-specific-iterated BLAST (PSI-BLAST) 
programs have been developed to identify weak but biologically relevant sequence 
similarities (S.F. Altschul et al., 1997, Nucleic Acids Res. 25:3389-3402). 
Furthermore, pattern-hit-initiated BLAST (PHI-BLAST) programs have been 
designed to identify specific patterns or sequence motifs shared by distantly-related 
proteins (Z. Zhang et al., 1998, Nucleic Acids Res. 26:3986-3990). Specialized 
BLAST programs are also available for performing searches of human, microbial, 
and malaria genome sequences, as well as searches for vector, immunoglobulin, and 
predicted human consensus sequences (National Center for Biotechnology 
Information (NCBI), Bethesda, MD). 

[00172] Both FASTA and BLAST programs identify very short exact 
sequence matches between the query sequence and the databases sequences, analyze 
the best short sequence matches ("hits") to determine if longer stretches of sequence 
similarity are present, and then optimize the best hits by dynamic programming (S.F. 
Altschul et al., 1990, J. Mol. Biol 215:403-410; W.R. Pearson, 1990, Methods 
Enzymol 183:63-98). In contrast, the SSEARCH program compares the query 
sequence to all the sequences in the database via pair-wise sequence comparisons 
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(T.F. Smith et al., 1981, J. Mol Biol. 147:195-197). Thus, the S SEARCH program is 
considered more sensitive than the BLAST and FASTA programs, but it is also 
significantly slower. The BLAST and FASTA programs utilize several 
approximations to increase their searching speed, and utilize statistical parameters 
(see below) to increase sensitivity and selectivity to approximate the performance of 
the S SEARCH program. 

[001 73] It is understood in the art that BLAST comparison of amino acid 
sequences requires a substitution matrix program (PAM30, PAM70, PAM120, 
PAM250, BLOSUM45, BLOSUM62, BLOSUM80, etc.; see, e.g., S.F. Altschul, 
1991, J. Mol Biol, 219:555-565; S.F. Altschul, 1993, J. Mol Evol 36:290-300; 
M.O. Dayhoff et al., 1978, Atlas of Protein Sequence and Structure, 5:345-352; R.M. 
Schwartz and M.O. Dayhoff, 1978, Atlas of Protein Sequence and Structure, 5:353- 
358; G.H. Gonnet et al., 1992, Science, 256:1443-1445; S. Henikoff and J.G. 
Henikoff, 1992, Proc. Natl Acad. ScL USA, 89:10915-10919; S. Henikoff and J.G. 
Henikoff, 1993, Proteins, 17:49-61). A substitution matrix program scores each 
possible amino acid residue substitution, reflecting the probability that the residue is 
related to the corresponding residue in the query. Although different substitution 
matrices use different scoring systems for quantifying the relationship between the 
compared residues, the results from different substitution matrices can be readily 
compared and evaluated by a person of average skill in the art. In fact, it is common 
practice to run several parallel BLAST searches, using a different substitution matrix 
for each search (S.F. Altschul, 1991, J. Mol. Biol, 219:555-565; S.F. Altschul, 1993, 
J. Mol Evol, 36:290-300; S. Henikoff and J.G. Henikoff, 1993, Proteins, 17(1):49- 
61). 

[00174] In some instances, BLAST analysis of amino acid sequences may 
require a filtering program (e.g., SEG or XNU). A filtering program removes 
repetitive regions from the query sequence (e.g., proline-rich regions). Although 
certain filtering programs determine regions of low compositional complexity (e.g., 
SEG), while other filtering programs determine regions with short, periodic, internal 
repeats (e.g., XNU), the results from different filtering programs are comparable and 
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can be evaluated by a person of average skill in the relevant art. Moreover, it is 
common practice to combine, alternate, or omit the filtering programs as required for 
a specific query sequence (see, e.g., B. Birren et al. (Eds), 1997, Genome Analysis: A 
Laboratory Manual, Volume 1: Analyzing DNA, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY). 

[00175] In addition, BLAST analysis may utilize different search parameters 
(e.g., scoring matrices and filters) depending on the specific features of the query 
sequence. For example, it is understood by the skilled practitioner that the BLOSUM 
matrices are not suitable for the shortest queries. Thus, the PAM matrices, such as 
PAM30 or PAM70, may be used for short query sequences (R.M. Schwartz and M.O. 
Dayhoff, supra; M.O. Dayhoff et al., supra). In general, as understood by the skilled 
person in the art, the following scoring matrices are recommended: 



Query length (amino acids) 


Substitution Matrix 


<35 


PAM30 


35-50 


PAM70 


50-85 


BLOSUM80 


>85 


BLOSUM62 



(see the National Center for Biotechnology Information website at Hyper Text 

Transfer Protocol ://World Wide Web .National Center for Biotechnology 

Information.National Library of Medicine.National Institutes of Health. 

GO V ernment/B asic Local Alignment Search Tool/matrix_info.Hyper Text Markup 

Language). 

[00176] Additionally, PAM 1 20 and BLOSUM62 matrices are designed to 
identify moderately diverged sequences, and these matrices may miss short/strong 
sequence similarities, or long/weak sequence similarities (S. Altschul et al., 1994, 
Nat Genet. 6: 1 19-129). For this reason, it has been generally recommended that 
researchers employ at least 3 separate substitution matrices for BLAST analysis (S. 
Altschul et al., 1993, J. MoL EvoL 36:290-300; S. Altschul et al., 1994, Nat. Genet. 
6:1 19-129). The use of different BLAST parameters with a particular query 
sequence is routinely practiced in the art and is well within the capability of an 



WO 03/035839 



-61- 



PCT/US02/34021 



average artisan (see, e.g., Altschul et al., 1994, Nature Genet, 6; 1 19-129; Altschul et 
al., 1996, Methods EnzymoL, 266:460-480; S. Henikoff and J.G. Henikoff, 1993, 
Proteins, 17(1):49-61). 

[00177] BLAST analysis of nucleic acid sequences generally require a unitary 
matrix program, which scores each position as "+" if the nucleotides match, or if 
the nucleotides do not match. In addition, BLAST comparisons of nucleic acid 
sequences may involve a nucleic acid-specific filtering program, such as DUST (J.M. 
Hancock and J.S. Armstrong, 1994, Comput. Appl. Biosci. 10:67-70). Notably, a 
particular sequence alignment program (e.g., BLAST, FASTA, or SSEARCH) and 
appropriate parameters (e.g., scoring matrices and filters) can be chosen based on the 
requirements of a sequence search, or individual preferences. In some cases, it may 
be preferable to use more than one program or set of parameters in order to confirm 
or evaluate search alignment results. 

[001781 The criteria for identifying a natural partner to a target using the 
computational approaches described above is defined as follows: a group of >5-7 
amino acids identical to a sequence found by the database searches. Alternatively, 
the finding that 50% or more of the hits in the top cohort are identical. In addition, 
the following information can be used to validate the target and natural partner for 
use in drug discovery paradigms: 

1 . Information about expression, disease indication, structural 
elements, etc. that relate the target to the putative natural partner. 

2. The ability to modulate the expected phenotype in the appropriate 
biological system. 

[00179] Proteome panning is particularly useful in drug discovery assays 
where target and target binder pairs are utilized in high-throughput screening. In 
particular, two ELISA methods may be used. The ELISA method I utilizes target 
phage and target binder phage, where each is labeled with a separate and distinct tag. 
The target phage is incubated onto a solid support, such as a plate previously coated 
with the target tag specific antibody. Addition of target binder phage and test 
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compounds from small molecule combinatorial libraries to the solid support follows. 
Any disruption of the target: target binder complex pairing thereby identifies test 
compounds or biologically active compounds that may bind to the target of interest. 
In order to ensure that there is no contamination, the ELISA method II utilizes pure 
target and target binders. This method requires subcloning, expression and 
purification of recombinant target protein, followed by the method of ELISA method 
L These methods may be preferred in that they can produce cleaner and more 
reproducible assays. 

[00180] Similarly, these methods may be adapted for microarrays, where target 
and target binder phages are incubated. Different fluorescent dyes are preferably 
used to distinguish between the target and the target binder. For example, Cy3 
identifies the target and Cy5 identifies the target binder. If a test compound binds to 
a target, then only the Cy3 signal will be emitted. Target and target binder pairs will 
be washed off the microarray except in conditions where the test compound binds to 
the target: target binder complexed pair, thereby emitting a combined Cy3/Cy5 
signal. This method may also be used to identify compounds binding to the target 
binder as well. 

[00181] A high-throughput screen developed by Perking Elmer (High- 
Throughput ALPHAScreen™) may be used with the proteome panning method of 
the invention. Briefly, target phage and target binder phage, each having a separate 
and distinct label or tag, are incubated with test compounds prior to the addition of 
acceptor and donor beads. In one example, the acceptor bead may specifically bind 
to the target as determined by the specific tag and the donor bead binds to the target 
binder, thereby forming two bead complexes: target: acceptor complex and target 
binder: donor complex. Another example may denote the target binding the donor 
bead and the target binder binding the acceptor bead. The two bead complexes may 
form a larger complex through binding the target and target binder, i.e. target: target 
binder complex. A signal is emitted once a target: target binder complex is formed. 
If a disruption of this target: target binder complex is observed, the specific test 
compound used identifies the biologically active compound. The microarray and 
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ALPHAScreen methods offer the advantage that they are simple, especially for 
analyzing a plurality of different test compounds at the same time. They also do not 
require additional steps such as phage separation, subcloning or purification. All four 
methods of using proteome panning for identifying biologically active compounds 
may be useful. 

[00182] All books, articles, and patents referenced herein are incorporated by 
reference intoto. The following examples illustrate various aspects of the invention 
and in no way intended to limit the scope thereof. 

EXAMPLES 

[00183] The examples as set forth herein are meant to exemplify the various 
aspects of the present invention and are not intended to limit the invention in any 
way. 

EXAMPLE 1 

CONSTRUCTION OF DGI-2-GENEIII FUSION PROTEINS IN pCANTABSE 
Creation of pCANTAB 5E Tetracycline resistant plasmid 

[00184] pCANTAB 5E with 2kb LacZ stuffer (DGI vector for RAPIDLIB) 
and pBR322 (Promega; Madison, WI), carrying the genes for tetracycline (Tet) and 
ampicillin (Amp) resistance were used to construct the pCANTABSE Tet resistant 
plasmid. pCANTAB (5 jag) was cut with Aat II (Base 65) and filled in with Pfii 
polymerase, followed by Seal cleavage at base 505. This removed the first 300 bases 
of the Amp resistance gene and its entire promoter. The vector was purified on Low 
Melt agarose. 

[00185] pBR322 (5 ^ig) was cut with Ava I (base 1 425) and filled in with Pfii 
polymerase, and then cut with Aat II at base 4284. This removed 1500 bases 
containing the entire Tet resistance gene. The fragment was purified on Low Melt 
agarose. 
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[00186] The fragment and vector were ligated and plated on Tet ( 1 O^g/ml) 
plates. Colonies were picked and replica plated on both Amp and Tet plates. 

[00187] Five colonies that were Tet resistant and Amp sensitive were grown in 
Tet media. The colony that grew the best was used, Maxi-prepped and labeled 
pCANTAB Tet. 

EXAMPLE 2 

CONSTRUCTION OF A BLUNT-ENDED CDNA LIBRARY 
IN TETRACYCLINE PHAGE 
[00188] Liver tumor HepG2 (hepatome) Lambda Uni_Zap XR (Stratagene, 
Cat. No. 937261) was used for constructing a cDNA library in a modified pCANTAB 
5E Tet resistant vector. The library mass (350-1500bp) was excised and cut with 
EcoRV and Smal, creating two sets of blunt end fragments. The Tet vector was 
modified by inserting a PCR fragment with Sfil and NotI ends. The PCR fragment 
created EcoRV sites inside of the Sfil and NotI sites. The modified vector was cut 
with EcoRV and gel purified using low melt agarose. The mass of the excised blunt 
end fragments was less than 1 KB cut out of the gel and purified. The insert was 
ligated to the blunt ended Tet phage vector. Additional guidance is commonly 
known in the art (Savinov and Austin. Comb. Chem. High Throughput Screen, 
4:593-7, 2001; Zucconi, et al. J. Mol Biol. 307:1329-39, 2001; Crameri and 
Kodzius. Comb. Chem. High Throughput Screen. 4:145-55, 2001; Mazzarelli and 
Ricciardi. Biotechniques. 30:380-6, 2001; Sihoud, et al. Int. J. Mol. Med. 6:123-8, 
2000; Cochrane et al. J. Mol. Biol. 297:89-97, 2000; and Santi, et al. J. Mol. Biol. 
296:497-508, 2000). 

[00189] In addition to the HepG2 library, subtracted cDNA libraries are used 
as the starting point for this process. In this case, the subtracted libraries are ligated 
into the Tet vector as above. The protocol that is used for subtraction is described by 
Clontech (Clontech PCR-Select Subtraction). This protocol is applicable to any 
library, such as but not limited to ovarian, breast, and colorectal cancer. This 
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approach is used for developing subtracted libraries for other disease indications 
including CNS, cardiovascular, angiogenesis, etc. 

EXAMPLE 3 
PRODUCTION OF TARGET PHAGE 
[00190] Phage rescue was carried out to produce phage with displayed target 
protein. Overnight cultures of TGI cells (F' traD36 lacP A(/acZ)M15 proAB) / supE 
A(hsdM-mcrB) s r k "m k " McrB) thi A{lac-proAB ) harboring the target protein -gene III 
gene fusion plasmids (2XYT-TG) were inoculated 1:100 into 10 ml of 2XYT-TG 
media. The cultures were grown to an OD 600 0.5 in 2xYT-TG at 30 °C with shaking 
(250 rpm). M13K07 helper phage was then added [Multiplicity of Infection (MOI) = 
15], and the cells were incubated for 1 hr at 37°C with shaking (250 rpm). Following 
infection, cells were pelleted and the supernatant containing the helper phage was 
discarded. The cell pellet was resuspended in the initial culture volume of 2xYT 
without glucose and containing tetracycline and 50 mg/mL kanamycin (2xYT-TK) 
and grown overnight at 30 °C with shaking (250 rpm). The cells from the overnight 
culture were pelleted at 3000 x g for 30 min at 4 °C and the supernatant containing 
the phage was recovered. The phage were titered by infecting TGI cells and plating 
on tetracycline plates. 

EXAMPLE 4 
PRODUCTION OF TARGET BINDER PHAGE 
[00191] The target binder library, 20F, was constructed by creating DNA 
fragments coding for peptides containing 20 random amino acids using PCR and 
synthetic oligonucleotides. An oligonucleotide was synthesized containing the 
sequence (NNK) 20 where N = A, C, T, or G and K = G or T. This oligonucleotide 
was used as the template in PCR reactions along with two shorter oligonucleotide 
primers, both of which are biotinylated at their 5' ends. The resulting PCR product 
was purified and concentrated (followed by digestion with Sfil and NotI). The 
resulting fragment was purified and the phagemid pCANTABSE (Pharmacia) was 
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digested with Sfil and NotL The digested DNA was resolved using a 1% agarose gel, 
excised and purified by QIAEX II treatment (Qiagen). The vector and insert were 
ligated overnight at 15°C. The ligation product was purified and electroporations 
were performed at 1500 V in an electroporation cuvette (0.1 mm gap; 0.5 ml volume) 
containing 12.5 g DNA and 500 L of E. coli strain TGI electrocompetent cells. 
Immediately after the pulse, 12.5 ml of pre- warmed (40°C) 2x YT medium 
containing 2% glucose (YT-G) was added and the transformants grown at 37°C for 
one hour. Cell transformants were pooled, the volume measured and an aliquot 
plated onto 2x YT-G containing 10^g/ml ampicillin (YT-AG) to determine the 
number of transformants. The diversity of the random 20-mer peptide cell library was 
found to be > 1.1 X 10 11 clones. Sequencing of randomly selected clones for the 
library revealed that about 77% of the clones were in frame. In addition, the E-tag 
epitope (GAPVPYPDPLEPR; SEQID NO: 222) was engineered into the carboxy 
terminus of the peptide. 

EXAMPLE 5 
PROTEOME PANNING WITH DGI-2 
[00192] For these studies, a phage library 2 OF, which expresses random 
peptides 20 amino acids long and has Ampicillin (Amp) resistance, and a phage that 
express the target of interest, specifically DGI-2, and has Tetracycline (Tet) 
resistance were used. One hundred microliters of 2.4 x 10 10 CFU (colony forming 
units) phage library 20F with a diversity of >10 u independent clones and a total 
concentration of >10 13 clones/ml was mixed with 100 microliters of 1.4 x 107 CFU 
(colony forming units) phage expressing DGI-2, the target of interest, and incubated 
at room temperature for 2 hours. Dilutions were made ranging from 10 2 - 10" 12 
(including individual inputs for each library). Each dilution was plated on one of 
each Amp (50 jag/ml), Tet (10 |ag/ml), and Amp (50 ^ig/ml) + Tet (10 |ag/ml) plates 
and incubated at 37°C overnight. 

[00193] On day 2, the number of colonies that grew on all of the plates were 
counted and the titers were recorded. Ninety-six individual clones were picked from 
Amp + Tet plates and inoculated into 5001 of 2xYT media containing 40% glucose 
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and Amp (50 fag/ml) + Tet (10 ^ig/ml) and incubated at 37°C shaking 225 rpm 
overnight. 

[00194] On day 3, 40 microliters of the Amp + Tet resistant clones were 
inoculated into 400 microliters of Amp only media to allow cells to selectively lose 
the Tet phagemid (expressing DGI-2) by removing the Tet selection pressure and 
retain the Amp phagemid (expressing DGI-2 binder peptide). Cells were incubated at 
37°C shaking at 250 rpm overnight. 

[00195] On day 4, Amp phage (DGI-binder phage) were rescued by 
inoculating 40 microliters of cells from day 3 in 400 microliters 2xYT media 
containing 40% glucose, 5Q^g/ml Amp, and M13K07 helper phage containing 
kanamycin (kan) resistance (kanR) marker. Cells were incubated for 3 hours at 37°C 
with shaking 250 rpm. Following infection, cells were pelleted for 15 minutes at 
3500 rpm at 4°C, supernatant removed, and the pellets were resuspended in the initial 
culture volume of 2x YT media containing 50t£/ml Amp and 5Q<g/ml Kan. Cells 
were grown overnight at 37°C with shaking at 225 rpm. 

[00196] On day 5 , cells from the overnight culture were pelleted and Amp 
resistant phage (DGI-2 binder phage) recovered in the supernatant. 

Proteome Panning in the Presence of Non-specific Phage 

[00197] In order to validate the Proteome panning method, non-specific and 
unrelated phage is used. Phage clone(s) expressing peptides that bind specifically to 
the target of interest are rescued and amplified. These phage clone(s) are then mixed 
with unrelated phage clone(s) that do not bind the target. By proteome panning of 
the mixed phage population, only the binder phage and not the unrelated phage is 
isolated. This indicates that entry of the peptide-expressing phage into cells is 
dependent on interaction with the target-expressing phage. 
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EXAMPLE 6 

EXPRESSION OF THE DGI-2 PROTEIN FOR ELISA ANALYSIS 
[00198] To verify that the selected target binders bound to DGI-2, Elisa assays 
were performed with recombinant DGI-2. DGI-2 was subcloned and expressed using 
conventional methods as described below. 

[00199} An IMAGE clone containing the DGI-2 sequence (IMAGE: 3838690; 
accession number BE748524) was obtained from the American Type Culture 
Collection (ATCC, Manassas VA). The clone included 770 bp and encompassed the 
entire DGI-2 open reading frame (ORF). 

[00200] The 336 bp DGI-2 ORF was amplified using PCR, the amplified 
sequence was digested with the appropriate enzymes, the digested DNA was ligated 
into the appropriate expression vectors, and the cloned DNA was sequenced. DGI-2 
was first expressed in E. coll The DGI-2 ORF was cloned into expression vector 
pTrcHis A (Invitrogen, Carlsbad, CA) using restriction sites Kpn I and Hind III, 
which were added to the DGI-2 by PCR. The primers used for PCR amplification are 
shown as follows: 

[00201] DGI-2 Kpn I forward primer for the TrcHis A vector: 5 GGA TCC 
GGG TAC CAT GGT GCG GAC TAA AGC AGA CAG -3' (SEQ ID NO: 223); 

[00202] DGI-2 Hind III reverse primer for the TrcHis A vector: 5 GGC 
CAG AAG CTT TTC TTT TTC ATC ATT TGT GTG ATC AGG -3' (SEQ ID NO: 
224); 

[00203] Protein yields were satisfactory, but most of protein was found in 
inclusion bodies. Next, DGI-2 was expressed using an in vitro translation system 
(Rapid Translation System RTS 5000, Roche Diagnostics GmbH, Mannheim 
Germany). For this system, the DGI-2 ORF was cloned into the pIVEX2.3-MCS 
vector (Roche Diagnostics) using restriction sites Nde I and Hind III, which were 
added to DGI-2 by PCR. In this case, the protein was found to be soluble, and was 
used for panning and ELISA analysis. The primers used for the PCR amplification 
were as follows: 
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[00204] DGI 2 Nde I forward primer for the IVEX vector:5'- GAT CTA CAT 
ATG GTG CGG ACT AAA GCA GAC AG c-3' (SEQ ID NO: 225). 

[00205] DGI 2 Hind III reverse primer for the IVEX vector: 5'- GAT GGC 
CAG GTC GAC TTC TTT TTC ATC ATT TGT GTG ATC AGG-3' (SEQ ID NO: 
226). 



EXAMPLE 7 
ELISA ANALYSIS OF PHAGE 
[00206] For analysis of individual target binders, colonies were picked and 
phage prepared as described above using helper phage, M13K07. Ninety-six-well 
microtiter plates were coated with 500 ng/well of specific target protein, 500 ng/well 
lactate dehydrogenase (LDH) irrelevant or non-specific target, 500 ng/well of 
positive control antibody raised against ETAG, and negative control of non-fat milk 
powder-PBS 2% at 4°C overnight. Plates were then blocked with 350 
microliters/well PBS containing 2% non-fat milk for 1 hr at room temperature. One 
hundred microliters per well of phage from isolated clones were added and plates 
were incubated at room temperature for 2 hr after centrifuging samples for 20 
minutes at 3500 rpm at 4°C. The phage solution was then removed, and the wells 
were washed three times with PBS at room temperature. Antibody raised against 
Ml 3 conjugated to horseradish peroxidase (Pharmacia) was diluted 1:3000 in PBS- 
Tween and added to each well (100 |al/well). Incubation was for 1 hr at room 
temperature, followed by washing with PBS-(0.5%) Tween 3 times as described. 
Color was developed by addition of ABTS solution (100 |il/well; Boehringer, 
Indianapolis, IN). Plates were analyzed at 405 nm. A clone was considered 
"positive" if the A 405 of the well was > 2-fold over background. 
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EXAMPLE 8 
DGI-2 PANNING RESULTS 
[00207] Overall, over 1 00 clones from the DGI-2 proteome panning and 
conventional panning experiments were sequenced. One hundred eighty-nine (189) 
unique peptides were identified (see Table 1), including 51 high-specificity binders 
(DGI-2-binders; SEQ ID NOs: 1-51), 1 1 weak binders (SEQ ID NOs: 52-62) and 2 
potential binders (SEQ ID NOs: 63-64) as determined by proteome panning; and 84 
high-specificity DGI-2 binders (SEQ ID NOs: 65-148), 12 weak binders (SEQ ID 
NOs: 149-160), and 29 potential binders (SEQ ID NOs: 161-189) as determined by 
conventional panning. Binders preferably have a binding ratio of specific phage to 
non-specific phage of >2.0, that is twice the background, weak binders are defined 
herein as those having a ratio ranging between 1.5 and 2.0, and potential binders have 
a binding ratio of <1 .5. Each of the high-specificity DGI-2-binders were isolated 
multiple times. Table 1 shows the encoded amino acid sequences and results for each 
of the clones panned against DGI-2. 

[00208] Further analysis of the DGI-2 binders resulted in the classification of 
several potential sequence motifs. Four motifs were identified as potential motifs for 
DGI-2 binders (see Figure 1). Motif 4 was found in clones derived from a 
conventional panning and proteome panning method indicating that similar peptides 
that potentially bind to the same site of DGI-2 were identified by conventional 
panning and proteome panning methods. Identifying sequence motifs provides a 
better understanding of the structure and function of the target as well as the search 
for potential target binders and natural partners. 
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EXAMPLE 9 
DATABASE ANALYSIS OF DGI-2 BINDERS 
[00209] DGI-2 binders that were identified and shown to bind specifically to 
DGI-2 are used to identify DGI-2-partners. A "Phenogenix" approach is used as 
described in co-owned patent applications, U.S. Patent Application Serial No. 
09/852,455 and International Patent Application No. PCT/US01/15092, which are 
incorporated herein by reference. These methods employ several different database 
search programs. Initially, the entire peptide sequence and consensus motifs of the 
DGI-2-binders are entered into an Advanced BLAST search (Hyper Text Transfer 
Protocol ://World Wide Web .National Center for Biotechnology 
Information.National Library of Medicine.National Institutes of 
Health.Government/Basic Local Alignment Search Tool/Basic Local Alignment 
Search Tool.Common Gateway Interference?Jform=l) using the following 
parameters: 

1. Programs: BLASTP, TBLASTN; 

2. Databases: protein and nucleotide databases including dbest (ESTs), 
dsts (STSs), and htgs (unfinished high throughput genomic sequences); 

3. Expect value: 1000 to 20000; 

4. Matrix: PAM30 or PAM70; and 

5. Query: consensus motif alone and varying combinations of sequence 
at the N- and C-terminal ends. 

[00210] In subsequent steps, motifs identified by sequence alignment 
programs like MEME (Multiple EM for Motif Elicitation), (Hyper Text Transfer 
Protocol://Multiple EM for Motif Elicitation.San Diego Supercomputer 
Center.Education/Multiple EM for Motif Elicitation/website/intro .Hyper Text 
Markup Language) were also used to search the available databases using MAST 
(Motif Alignment and Search Tool, Hyper Text Transfer Protocol ://Multiple EM for 
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Motif Elicitation. San Diego Supercomputer Center.Education/mem/website/mast- 
intro.Hyper Text Markup Language). Motifs and consensus domains were further 
used as query patterns to search the protein databases using Patternfind (Hyper Text 
Transfer Protocol ://World Wide Web.isrec.isb-sib.ch/software/PATFND_ 
form.Hyper Text Markup Language). 

[00211] For Patternfind, the following parameters are used: 

1. Databases: non-redundant, Swissprot, TREATS, and TROGON; 

2. Limit: between 10 and 5000; and 

3. Query: consensus motif alone and varying combinations of sequence 
at N- and C-terminal ends. 

[00212] Data obtained from the various searches are analyzed using the 
following approach: 

1 . Results of different searches are analyzed independently and then 
combined to search for similar classes of proteins (e.g., nucleic acid binding 
proteins, kinases) that emerged; 

2. The best matches identified in more than one kind of search (e.g. 
same protein/ORF picked up by BLAST searches using different parameters, or by 
both BLAST and Patternfind) are picked. The amino acid sequence of the protein in 
the identified region is compared with other peptide surrogates containing this 
amino acid sequence; and 

3. The protein interaction is evaluated in view of the cellular function of 
the target, DGI-2. 

[00213] The criteria for identification of a DGI-2-partner (i.e., partner hit) 
include the following: 

1 . Search produces an exact match of > 5-7 amino acids; 
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2. Same partner appears in at least 50% of the top hits of any one 
search, and/or the same or related hits occur in multiple searches; 

3. Search produces expected class of protein partners based on function, 
cellular location, or tissue/disease distribution; and 

4. Partner candidate produce a phenotype change when added into the 
appropriate model system. 

[00214] Unless there is an exact sequence match (criterion 1), the candidate 
partner is required to satisfy at least two other criteria to be considered a partner hit. 

EXAMPLE 10 

PANNING FOR HIGH AFFINITY BINDERS AND MOTIFS 
[00215] It may be expected that, in the case of using a cDNA library as the 
target, only one target binder will be found per target. Also, a consensus or motif 
may not be obvious in the absence of multiple binders to a target. The methods 
described here allow one to identify multiple target binders per target as well as 
motifs and consensus domains on the binders. Following proteome panning and 
identification of the target and target binders using the computational methods 
described above, each clone is grown in tetracycline for two cycles (48 hours) to 
eliminate the ampicillin resistant phage expressing the target binders. Each target 
phage (tet resistant) is then rescued with helper phage and subjected to an additional 
round of proteome panning versus the library of target binders. As an example, if 
100 colonies are picked from the first round of proteome panning and grown in 
tetracycline, then 100 of the Tet resistant phage are mixed with the original random 
library (including but not limited to RAPIDLIB20 or RAPIDLIB40, etc). This 
process may be performed in a 96 or 384 well format by mixing target and target- 
binder libraries at a ratio of 1 : 1 . Conditions for the proteome panning step are based 
on conditions affecting the ability to generate high affinity pairs such as 
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temperature, pH, ionic strength and additional washing steps with different buffers. 
After 2 hours, the target/target-binder phage are used to infect the host cells as 
described above. After plating, 100-500 colonies are picked, sequenced and 
subjected to computational analysis as described. It is expected that there will be at 
a minimum of one motif per target which may be used as the basis for generating 
secondary libraries to obtain the appropriate affinity, selectivity and potency. 

EXAMPLE 11 
PANNING WITH DGI 2, HRAS AND LEPTIN LIGAND 
[00216] One hundred microliters of 3 .6 x 1 0 13 CFU (colony forming units) 
per ml random peptide phage libraries (two peptide libraries were mixed - a 
constrained 25 amino acid peptide library, 20C, and a random 20 amino acid library, 
20R) was mixed with 100 microliters of either 3.3 x 10 8 CFU (colony forming units) 
per mL phage expressing DGI-2, 4.5 x 10 8 CFU (colony forming units) per mL 
phage expressing Hras, or 2.7 x 10 8 CFU (colony forming units) per mL phage 
expressing Leptin ligand, and incubated at room temperature for 1 hour. Dilutions 
were made ranging from 10' 1 - 10" 6 . One hundred microliters of each dilution were 
used to infect competent E. coli TGI cells and plated on Ampicillin (Amp; 50 
jag/ml) + Tetracycline (Tet; 10 |ig/ml) plates. Plates were incubated at 37°C 
overnight. 

[00217] On day 2, the number of colonies that grew on all of the plates were 
counted and the titers were recorded. Ninety-six individual clones were picked from 
Amp + Tet plates and inoculated into 5001 of 2xYT media containing 40% glucose 
and Amp (50 ^ig/ml) + Tet (10 fig/ml) and incubated at 37°C shaking 225 rpm 
overnight. 
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[00218] On day 3, 40 microliters of the Amp + Tet resistant clones were 
inoculated into 400 microliters of Amp only media to allow cells to select out the 
Tet phagemid (expressing DGI-2, Hras, or Leptin ligand) by removing the Tet 
selection pressure and retaining the Amp phagemid (expressing peptide binder). 
Cells were incubated at 37°C shaking at 250 rpm overnight. 

[002191 On day 4, Amp phage (binder phage) were rescued by inoculating 40 
microliters of cells from day 3 in 400 microliters 2xYT media containing 40% 
glucose, 50g/ml Amp, and M13K07 helper phage containing kanamycin resistance 
marker. Cells were incubated for 3 hours at 37°C with shaking 250 rpm. Following 
infection, cells were pelleted for 15 minutes at 3500 rpm at 4°C, supernatant 
removed, and the pellets were resuspended in the initial culture volume of 2x YT 
media containing 5Qxig/ml Amp and 50Xig/ml Kan. Cells were grown overnight at 
37°C with shaking at 225 rpm. 

[00220] On day 5, cells from the overnight culture were pelleted and Amp 
resistant phage (binder phage) were re-infected into competent E. coli TGI cells. 
Each clone was plated on a separate plate, individual colonies were picked from 
each plate and sequenced after amplification by PCR. 

[00221] This method resulted in a good separation of the two interacting 
phage and allowed for sequencing and further analysis by ELIS A where target 
protein was available. 

EXAMPLE 12 
PANNING OF cDNA LIBRARY (cDNALIB) 
[00222] One hundred microliters of 1 .8 x 1 0 13 CFU (colony forming units) 
per ml of random peptide phage libraries (two peptide libraries were mixed - a 
constrained 25 amino acid peptide library, 20C, and a random 20 amino acid library, 
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20R), or 1.4 x 10 12 CFU (colony forming units) per mL cDNALIB (amp) library, or 
2.3 x 10 11 CFU (colony forming units) per mL cDNALIB (amp) library, or 7.8x 10 12 
CFU (colony forming units) per mL of random peptide phage libraries (random 40 
amino acid, RAPIDLIB), was mixed with one hundred microliter of 7.0 xl0 8 CFU 
per mL cDNALIB (tet), and incubated at room temperature for two hours. Dilutions 
were made ranging from 10" 1 - 10' 6 . Fifty microliters of each dilution was used to 
infect competent E. coli TGI cells and plated on Amp (50 fig/ml) + Tet (10 |ig/ml) 
plates. Plates were incubated at 37°C overnight. 

[00223] On day 2, the number of colonies that grew on all of the plates were 
counted and the titers were recorded. Ninety-six individual clones were picked from 
Amp + Tet plates and inoculated into 4001 of 2xYT media containing 40% glucose 
and Amp (50 |xg/ml) + Tet (10 jxg/ml) and incubated at 37°C shaking 225 rpm 
overnight. 

[00224] On day 3, 40 microliters of the Amp + Tet resistant clones were 
inoculated into 400 microliters of media containing either Amp ( 504&tol) or Tet (50 
^(g/ml) to allow cells to select out either the Tet or the Amp phagemid respectively, 
by removing selection pressure for one of the two antibiotic markers. Cells were 
incubated at 37°C shaking at 250 rpm overnight. 

[00225] On day 4, phage were rescued by inoculating 40 microliters of cells 
from day 3 in 400 microliters 2xYT media containing 40% glucose, SO^g/ml Amp or 
Tet, and M13K07 helper phage containing kanamycin resistance marker. Cells 
were incubated for 3 hours at 37°C with shaking 250 rpm. Following infection, 
cells were pelleted for 15 minutes at 3500 rpm at 4°C, supernatant removed, and the 
pellets were resuspended in the initial culture volume of 2x YT media containing 50 
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g/ml Amp or Tet and 50<g/ml Kan. Cells were grown overnight at 37°C with 
shaking at 225 rpm. 

[00226] On day 5, cells from the overnight culture were pelleted and Tet 
(target) or Amp (binder) resistant phage were re-infected into competent E.coli TGI 
cells. Each clone was plated on a separate plate, individual colonies were picked 
from each plate and sequenced after amplification by PCR. 

[002271 This method resulted in a good separation of the two interacting 
phage and allowed for sequencing and further analysis. 

EXAMPLE 13 

MODIFIED PANNING OF cDNA LIBRARY (cDNALIB) 
[00228] For proteome panning, a tetracycline resistant cDNA phage displayed 
library (Tet R phage) was mixed with multiple ampicillin resistant RAPIDLIB™ 
phage displayed peptide libraries (Amp R phage) in 10ty<L volume. The amount of 
Tet R phage particles was equal to the amount of Amp R phage and the total number of 
phage particles used was about 5 x 10 8 . 

[00229] The phage mixture was slowly rotated for 2 hours at room 
temperature (RT; about 25°C) and then used to infect log phase E.coli TGI (OD 600 ~ 
0.8 - 1.0; ~ 6 x 10 8 cfu/mL) at a phage-to-cell ratio of 1 : 25-1000, and preferably a 
low phage-to-cell ratio of 1 : 100-500. Low cell-to-phage ratio is necessary to 
minimize the infection of cells by non-paired Tet R phage and Amp R phage. Cells 
were incubated for 30 min at 37 °C with gentle shaking to allow infection to occur 
and then separated from medium by centrifugation at 1000 x g for 10 min. Then 
infected cells were resuspended in small volume of 2xYT medium and plated onto 
2xYT agar medium containing 2% glucose, lOQig/mL ampicillin and l£<g/mL 



WO 03/035839 



PCT/US02/34021 



-89- 



tetracycline (2xYT-GAT) to obtain individual colonies. Plates were incubated at 30 
°C overnight. Grown colonies were counted and used for recovery and separation of 
paired Tet 11 phage and Amp R phage. 

[00230] Separation of paired Tet 1 * phage and Amp R phage was done by 2 
consequent phage recoveries using modified phage rescue technique. Bacterial 
colonies bearing paired Tet R phage and Amp R phage were grown in 96 well cluster 
tube boxes overnight at 30 °C, 250 rpm, in 2xYT/ampicillin (100 
>ug/mL)/tetracycline (15xig/mL) medium containing 0.1% glucose (2xYT-0.1G/AT). 
Overnight cultures then were used to inoculate two sets of cluster tubes with phage 
rescue media (2xYT-0.01% glucose medium containing M13K07 helper phage (~ 4 
x 10 9 cfu/mL) and either ampicillin or tetracycline) with dilution rate 1:10. After 2 
- 2.5 hours of growth at 37 °C 5 250 rpm, kanamycin was added to final 
concentration 5Qcg/mL and infected bacteria were incubated overnight for Tet R 
phage and Amp R phage amplification. 

[00231] The following day phage-containing supernatants cleared of bacteria 
by centrifugation (1000 g 9 10 min) were diluted in 2xYT medium (at the rates 1 : 10 6 
for Amp R phage and 1 : 10 5 for Tet R phage) and IQ£, of dilutions were used to infect 
10<^ of log phase E. coli TGI cells. Cells were incubated at 37 °C, 150 rpm, for 1 
hour for efficient infection followed by addition of 2xYT-0.1%G/AT. Cells were 
grown overnight at 30 °C, 250 rpm, and obtained cultures were used for second 
round of phage amplification. 

[00232] After second phage rescue, recovered Amp R phage were analyzed by 
ELIS A in order to detect the expression of phage-displayed peptides. Amp R phage 
with the strongest signals over background (equal or above 2x) as well as 
corresponding Tet R phage were diluted and used for infection of TGI cells as 
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described above. Cells were plated onto 2xYT-2% glucose medium containing 
appropriate antibiotic and incubated overnight at 30 °C. Grown colonies were used 
for PCR and subsequent sequence analysis. 

EXAMPLE 14 
PRODUCTION OF cDNA 
[00233] In order to produce cDNA, three methods are utilized: 1 ) random 
priming and directional cloning; 2) PCR-based oligo dT priming and directional 
cloning; and 3) excision of cDNA from existing libraries. 

Random Priming and Directional Cloning 

[00234] Ligation-ready subtracted or unsubtracted cDNA was produced by 
classical cDNA methods using random priming and poly AH- RNA from various 
normal and cancerous tissues. Priming the RNA was carried out by a random 
pentamer (5'-GCTCGCCCTCGGCGGCCGCNNNNNT-3'; SEQ ID NO: 191) at 
various concentrations which controls the length of the cDNA to obtain more 
positive clones in the expression screen to the preferred range of 200bp to 2,000bp . 
In order to carry out directional cloning, the random primer contained the restriction 
site Not I 5' to the random pentamer sequence. The poly A+ RNA (2 g) for this 
reaction was purchased, isolated from particular cell lines or tissues, or in the case of 
subtracted libraries the "target specific RNA" (10/J of the 2(^J), obtained from the 
multi- round subtractive hybridization protocol (see below). 

[00235] First stand synthesis was performed by combining the RNA and the 
random pentamer (^g) primer to a final volume of^uh If the final volume exceeded 
(as in the case of the subtracted mRNA), the reactions that followed were scaled 
up. The tube containing the combined RNA and random pentamer was incubated at 
72°C for 2 minutes in order to denature the RNA and primer. The tube was then 
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cooled by placement on ice for 2 minutes. Collection of the contents of the tube was 
performed by centrifuging at 14,000 rpm for 20 sees in a microcentrifuge. For the 
first strand reaction to occur, 2^1 (5 X First Strand Buffer), 1^1 dNTP (10 mM), and 1 
( \ AMV Reverse Transcriptase (20 units /^l) were added to the contents of the tube. 
The primers were extended at 25°C for 10 min. The contents of the tube were then 
mixed gently and incubated at 42°C for 1.5 hour. To carry out the second-stand 
cDNA synthesis, 1^1 (5X 2 nd strand buffer, 1.6^1 dNTP (lOmM), 4.tyJ (20X second- 
strand cocktail) and 48.4 1 depc-H 2 0 were added on ice to a final volume of 80^1. 
Incubation of the reaction tube occurred for 2 hours at 16°C. After the incubation,^ 
(6 units) of T4 DNA polymerase was added to the reaction tube and incubated again 
at 16°C for 30 min. After the incubation 4*1 of 20 X EDTA/ glycogen mixture was 
added to the tube. Phenol extraction and precipitation of the cDNA was performed 
with isopropanol/sodium acetate and then phosphorylated. The kinase reaction 
involved adding 5^1 of T4 Kinase (10 units^l), 1Q*1 (10X T4 DNA ligase buffer), and 
water to the cDNA resulting in a final volume of 100^1. The reaction tube was then 
incubated at 37°C for 1 hour. After incubation, phenol extraction, precipitation, and 
quantitation was followed by the Picogreen method (Molecular Probes; Eugene, 
OR). The final digestion was carried out at the common Not I site (derived from the 
random primer), which provided for the directional cloning of the fragments. The 
digestion with the Not I enzyme was performed in a total volume of 100^1 for 4 
hours at 37°C Buffer 3 (NEB). Following enzyme digestion, the cDNA was 
chloroform extracted, precipitated and resuspended in water. To carry out the 
expression screen, the cDNA derived from each tissue was ligated into the Sma I- 
Not I sites of the green fluorescent protein (GFP) vector or the Kan (amp or tet) 
vector Frames 1, 2, and 3 using a molar ratio of 1 : 10. 
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PCR-Based Oligo dT Priming And Directional Cloning 
[00236] Ligation-ready subtracted or unsubtracted cDN A from various 
normal and cancerous tissues for directional cloning was produced by the Smart 
cDNA Synthesis Kit (Clontech) with some modifications. The poly A+ RNA (l^g) 
for the reaction was either purchased, isolated from particular cell lines or tissues, or 
in the case of subtracted libraries the "target specific RNA" used (104 of the 20/$) 
was obtained from the multi- round subtractive hybridization protocol (see below). 
To carry out the first stand synthesis, the RNA, the 3' SMART CDS Primer IIA (10 
M) and the modified 5' Smart Primer (10/jVl) were combined for a final volume of 5 
1. Six versions of the modified 5' Smart Primer were used, each containing one of 3 
reading frames and two sequence variations to provide for differential sequencing by 
PCR. If the volume exceeded ^1 (as in the case of the subtracted mRNA), the 
reactions that followed were scaled up. The reaction tube was incubated at 72 °C for 
2 minutes to denature the RNA and primer. The tube was placed on ice for 2 
minutes. The reaction tube was centrifiiged for collection of the contents. The 
addition of 2^1 (5 X First Strand Buffer), 1^1 DTT (20 mM), 1^1 dNTP (10 mM), and 
1^1 Power Script Reverse Transcriptase to the 5/4 volume provided for the first strand 
reaction. The contents of the tube were mixed gently and incubated at 42°C for 1 hr 
and then placed on ice to terminate the reaction. PCR amplification produced 
double stranded cDNA by reacting fyl of the subtracted mRNA first strand reaction 
or O.^J un-subtracted mRNA first strand reaction with the PCR reaction containing 
ig C( l(10X Advantage 2 Buffer), 2^1 (50X dNTPs), 44 modified Smart Primer, 4^5' 
PCR primer, and^al Advantage 2 Enzyme in a total volume of 10(^J. For the PCR 
amplification rounds, one preliminary cycle (95°C, 1 min) and then 24 cycles were 
carried out ( 95°C 5 sec, 65°C 15 sec, 68°C 6 min). The resulting cDNA was column 
purified using QIAquick™ columns (Qiagen) and quantitated by Picogreen method 
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(Molecular Probes; Eugene, OR). In order to obtain preferred size fragments (lOObp 
to 2000bp) for insertion into the phage display vector, the initial long cDNA 
fragments were digested with multiple enzymes having known infrequent cutting 
profiles in the mammalian genome. The digestions were conducted in the following 
manner. For each digestion, S^jg of the double stranded cDNA PCR product was 
used, each enzyme was added in a volume of ^J, and 10 X Buffer (1^1) and 100X 
BSA were added. The total volume for each reaction was 150^1. The digestions 
were as follows: 



Digestion Reaction 1 containing enzymes Nar I, Nae I, and Avr II and NEB 
Buffer 1 was carried out for 4 hours at 37°C. 

Digestion Reaction 2 containing enzymes EcoRl, Hind III, and BbvC I and 
NEB Buffer 2 was carried out overnight at 37°C. 

Digestion Reaction 3 containing enzymes Bam HI, Xba I, and Ssp I and 
NEB Buffer 2 was carried out for 4 hours at 37°C. 

Digestion Reaction 4 containing enzymes Bgl II, Bgl I, Xho I and NEB 
Buffer 3 was carried out for 4 hours at 37°C. 

Digestion Reaction 5 containing enzymes Sac I, Sph I, Dra I and NEB 
Buffer 4 was carried out overnight at 37°C. 



Digestion Reaction 6 containing enzyme Rsa I and NEB Buffer 1 was 
carried out for 4 hours at 37°C 
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[00237] The DNA in each digestion reaction was extracted by chloroform and 
precipitated with isopropanol/sodium acetate. The 5' and 3 5 overhangs were filled in 
by the Pfu DNA Polymerase polishing reaction. The precipitated DNA was 
resuspended in 2(^(1 water and the polishing reaction was carried out in a total 
volume of lOt^l containing 1(^1 (10 X Buffer), lQaJ (lOmM dNTPs), and 10^ Pfu 
DNA Polymerase (2.5units^l). The reaction was incubated at 72 °C for 30 minutes 
after which the cDNA was then phenol-chloroform extracted, precipitated and 
resuspended in 5^u} water. The final digestion was carried out at the common Sfi I 
site (derived from the modified Smart Primer; DGI), which provided for the 
directional cloning of the fragments. The digestion with the Sfi I enzyme was 
carried out in a total volume of KXfcJ for 4 hours at 50°C Buffer 2 (NEB). DNA 
quantitation was performed by the Picogreen method. The cDNA was ligated into 
the Sfi-Sma site of either the GFP vector or the Kan vector for the expression screen 
using a molar ratio of 1 : 10. 

Excision Of cDNA From Existing Libraries 

[00238] Pre-existing cDNA libraries from normal and cancerous tissues were 
excised at the appropriate restriction sites. This method entailed preparing and 
digesting the plasmid following manufacturers * instructions (Qiagen). After the 
digestion was complete, the DNA analysis was performed on a 1% agarose gel. The 
gel portion containing the resultant DNA smear was extracted by removing the 
DNA smear. Purification by commercial or known methods (QIAquick) was 
followed by optical density quantitation. In order to obtain preferred size fragments 
(lOObp to 3000bp, more preferably 200bp to 2000bp) for insertion into the phage 
display vector, the initially long cDNA fragments were digested with multiple 
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enzymes having known cutting profiles in the mammalian genome. The digestions 
were conducted in the manner described above for the PCR-based oligo dT Priming. 
To carry out the expression screen, the cDNA derived from each tissue was ligated 
into the Sma I - Sma I sites of the GFP vector or the Kan (amp or tet) vector Frames 
1, 2, and 3 using a molar ratio of 1:10. 

EXAMPLE 15 
SUBTRACTIVE HYBRIDIZATION 
[00239] Subtractive hybridization was carried out at the mRNA level using 
the Dynabead system from Dynal Biotech as follows with some modifications. The 
Dynabead Oligo (dt) 25 (magnetic beads) were first conditioned in preparation for 
use. A volume of l ,ml of magnetic beads was transferred to the magnetic stand for 
30 minutes. After the time period, the supernatant was removed and the magnetic 
beads were resuspended in 40^1 Binding Buffer. In the next, step poly A+ RNA 
(normal tissue) was annealed to the magnetic beads. To do this, the 10/g poly A+ 
RNA (suspended in 40^1 DEPC water) and the beads (400^1 in Binding buffer) were 
mixed thoroughly and annealed by rotating the RNA and bead mixture on a roller 
for 3-5 min at room temp. The tube was then placed in the magnetic stand for 30 
seconds and the supernatant was removed. The poly A+ RNA/dt was washed with 
Wash Buffer A (80(^1) and then washed with Wash Buffer B (80£J). The poly A+ 
RNA/dt was then washed 2 times with Cold RT Buffer (1 ml) and the RT Buffer 
was removed. In the next step, solid phase synthesis of cDNA was carried out on 
the magnetic beads. 

[00240] A combination protocol was carried out using a rTth Reverse 
Transcriptase, a heat stable enzyme used at elevated temperatures to reduce 
secondary structure. After removal of the RT Buffer by the magnetic stand above, 
900/J of the RT Reaction Buffer 1 was added to the poly A+ RNA/dt containing 
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lmM MnCl 2 , dNTPs and rTth enzyme (225 units). The tube was transferred to the 
hybridization oven for 60 min at 37°C. After the incubation RT Reaction Buffer 1 
was removed by the magnetic stand (30 sec) and RT Reaction Buffer 2 was then 
added (900^1) containing rTth (50 units) and incubated in the hybridization oven for 
10 min at 70°C. The tube was transferred to the magnetic stand and the supernatant 
was removed. The beads (subtractor cDNA Dynabeads) were resuspended in 90(^1 
2 mM EDTA and heated at 95°C to melt away the poly A+ RNA. The supernatant 
was immediately removed by the magnetic stand. The subtractor cDNA Dynabeads 
were washed 2 times in 90(^<1 TE Buffer, pH 8.0 and then resuspended in 500/J TE 
Buffer, pH 8.0. Subtractive hybridization proceeded in the following manner. The 
subtractor cDNA Dynabeads in TE Buffer and J^g target poly A+ RNA (cancerous 
tissue) in hybridization buffer (200^) were heated simultaneously at 68 °C for 3 
minutes. The supernatant for the subtractor beads was removed and the 20(^1 
containing the target poly A+ mRNA was transferred to the tube. The tube was 
wrapped with parafilm and incubated at 65-68°C for 20-24 hours under constant 
rolling. After the hybridization, the tube was transferred directly to ice. The 
subtractor Dynabeads with the mRNA/cDNA hybrids were collected by the 
magnetic stand on ice. The hybridization solution containing the target specific 
mRNA was transferred to a new tube on ice awaiting a second hybridization. The 
subtractor Dynabeads were regenerated by adding 2(^J water and heating for 3 
minutes at 68°C to elute mRNA. The tube containing the regenerating Dynabeads 
was placed in the magnetic stand and the supernatant was removed. The 
regenerated Dynabeads were resuspended in 10(^J TE solution, ready for a further 
round of hybridization. The hybridization was carried out 2 more times. After the 
final round, the target mRNA for the hybridization solution was isolated. To do 
this, 20 a1 of new Dynabeads OUgo(dt) 25 were pre-washed in hybridization buffer. 
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The 20(^1 hybridization buffer containing the target specific mRNA was then added 
to the washed new Dynabeads. The mixture was incubated for 5-10 minutes with 
constant rolling at room temperature. Using the magnetic stand, the target specific 
mRNA/Dynabeads complex was washed once with 200 1 Buffer A and 2 times with 
20(^1 Buffer B. After removal of Buffer B, the target specific mRNA was eluted 
with 20^1 lOmM Tris pH 7.5 by incubating at 65°C for 2 minutes. The "target 
specific RNA" was then used for cDNA synthesis in the method above. 
Alternatively, subtraction was also accomplished by using the PCR-Select cDNA 
Subtraction Kit (Clontech). 



EXAMPLE 16 

SCREENING FOR cDNAs WITH OPEN READING FRAMES (ORFS) 
[00241] The target diversity for the cDNA RAPID LIB library was 1 0 5 "open 
reading frame clones." To achieve this diversity, the cDNA produced by the 
methods described above was cloned into expression screening vectors containing 
either the green fluorescent protein (GFP) or the kanamycin resistance gene. 
Construction of expressing cDNA- gene III fusion proteins in pCANTABSE 
(phagemid) required the cDNA to be in frame with the gene III leader sequence at 
the 5' end and the gene III at the 3' end. To increase the number of in frame 
cDNAs, two screening vectors were designed. These vectors were the GFP and 
Kanamycin resistance screening vectors. The GFP vector was designed so that only 
an "in frame" ORF cDNA will produce green colonies. The out of frame and 
termination codon containing cDNAs generated white colonies. The high 
background (many colonies need to be plated) of the GFP screen led to the design of 
an antibiotic resistance screen using the Kanamycin resistance gene aminoglycoside 
phosphotransferase. Unlike the GFP screen, the Kanamycin screen only produced a 
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resistant colony if an "in-frame" ORF was present in the clone. Preferably, the two 
step expression screen is performed using kanamycin screening followed by GFP 
screening. This is preferred since only clones with open reading frames will grow 
on kanamycin media, thereby eliminating much of the initial non-expressing clones. 
By performing the kanamycin screen first reduces the number of plates in the GFP 
screen by 1/10*. 

EXAMPLE 17 

CONSTRUCTION OF GFP SCREEN FOR cDNAs WITH "IN FRAME ORFs" 
[00242] The screen selected cDNAs with open reading frames that were "in 
frame" with the green fluorescent protein (GFP). The GFP used was wild type, not 
a mammalian codon optimized sequence. In order to construct the vector, the GFP 
gene was obtained by cloning the enhanced green fluorescent protein (GFP) from 
Clontech GFP cloning vector pEGFP-Cl. The GFP ORF was trimmed to 249 
amino acids, a minimum size which still retained activity. PCR primers were 
designed to clone the GFP ORF into the Nde I and Kpn I sites of the pUC 19 
sequencing vector. Cloning the GFP ORF into pUC 19 replaced the lacZ gene. The 
promoter and first 19 amino acids MAPDAVFSPYASVRYFTPH (SEQ ID NO: 
192) from lac Z were retained in the vector. The PCR primers were designed to 
incorporate 2 stop codons at the 3 'end of GFP and to add a Not I site at the 5' end. 
For cloning, l/(g pUC 19 plasmid (New England Biolabs) was digested with 10 units 
each of Nde I and Kpn I in a total volume of 20^ and incubated at 37 °C for 1 hour. 
After the incubation, the DNA was treated with shrimp alkaline phosphatase for an 
additional hour, followed by chloroform extraction and ethanol precipitation. The 
GFP insert was amplified by PCR. For the reaction, 10 ng of Clontech GFP cloning 
vector pEGFP-Cl, 1^1 of 10 uM each of: 
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GFP forward cloning primer: 
5'-GCCATTCTTGGTACCGCGGCCGCCCCGGTCGCCACCATGGT 
GAGCAAG -3' (SEQ ID NO: 193) and 

GFP reverse cloning primer: 
5 ' -GGAATTCCTGC AT ATGCT ACT ATCG AG ATCTG AGTCCGGACT 
TG (SEQ ID NO: 194), 

[00243] 1 0 jal 5X ThermalAce buffer, 1^1 of SOX dNTPs, and j^l Thermal Ace 
Polymerase (Invitrogen) were combined in a total volume of 50^ol. PCR was 
performed in MJ Research TETRAD cycler (one cycle of 95 °C for 5 min, 30 cycles 
of 60 °C for 45 sec, 72 °C for 60 sec, 94 °C for 45 sec, and a final cycle of 72 ° C for 
5 min). The GFP PCR product was chloroform extracted and ethanol precipitated. 
To prepare the insert, 500 ng of the PCR product was digested with Nde I and Kpn I 
as described above and then chloroform extracted and ethanol precipitated. For the 
ligation, 100 ng of digested PCR product was ligated to 50 ng of digested pUC19 
overnight using T4 DNA ligase (New England Biolabs). The ligation was 
transformed into TOP 10 one shot competent cells (Invitrogen) and plated on amp 
plates. The insert was confirmed by PCR and sequencing. 

[00244] To test the vector and incorporate identical restriction sites, as in the 
phage display vector pCANTABSE, a "staffer fragment" consisting of the 705 base 
pair sequence of TNF R2 was cloned in at the 5' end of the GFP using PCR. The 
stuffer fragment only funceions to facilitate the introduction of cloning sites. There 
is no size limitation on the stuffer or replacement inserts. The TNF R2 sequence has 
been modified to have a Sfi I and a Sma I site at its 5' end. These sites are in frame 
with the leader sequence. It has also been modified with a Sma I and Not I site at its 
3' end. The sites were in frame with GFP. The complete translation product 
resulted in a leader sequence, TNF R2 and GFP. For the cloning, PCR was carried 
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out in a 5(^1 reaction containing 10 ng of TNFR2 extra cellular fragment (previously 
cloned by RT-PCR), 1/4 of 10 M each of: 

TNFR2 forward primer: 
5'-GATTTCCTTGCGGTACCGGCCCAGCCGGCCATGGCCCCC 
GGGATGGCGTTGCCCGCCC AGGTGGC-3 ' (SEQ ID NO: 195); and 

TNFR2 reverse primer: 
5'-GATTTTCGCGGCCGCGTCGCCAGTGCTCCCTTCAGC (SEQ ID NO: 196), 
[00245] 1 Old 5X ThermalAce buffer, \/A of SOX dNTPs, and \M TheimalAce 
polymerase (Invitrogen). PCR was performed in a MJ Research TETRAD cycler 
using the following conditions: one cycle of 95 °C for 5 min, 30 cycles of 60 °C for 
45sec, 72 °C for 60 sec, 94 °C for 45 sec, and a final cycle of 72 ° C for 5 min. The 
PCR product was chloroform extracted and ethanol precipitated. The pUC 19- GFP 
construct (j^g) was digested with Notl and Kpn I, 10 units each, for 1 hour and then 
dephosphorylated. The TNF R2 PCR product (500 ng) was also digested with Not I 
and Kpn I. Both vector and insert was chloroform extracted and ethanol 
precipitated. For the ligation, 100 ng of digested PCR product was combined with 
50 ng of digested pUC19-GFP overnight using T4 DNA ligase (New England 
Biolabs). After incubation, the DNA was transformed into TOP 10 one shot 
competent cells (Invitrogen) and plated on AMP plates. The insert was confirmed 
by PCR and sequencing. The resultant clone (GFP screening vector) developed 
green colonies as expected. The GFP screening vector cloning sites Sma I and Sfi I 
/ Not I have identical reading frames with the phage display vector (pCANTABSE) 
at the same restriction sites so that cDNA fragments that are "in frame" with the 
GFP protein will also be "in frame" with gene III in the phage display vector. The 
complete translation product from the GFP screening vector Lac promoter was the 
leader sequence MAPDAVFSPYASVRYFTPH (SEQ ID NO: 192), i.e., the first 19 
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amino acids of the GFP fusion protein, followed by the extracellular sequence of 
TNF R2 and the GFP. The TNF R2 sequence served as the "stuffer fragment" for 
the vector and was excised when the vector fragment was prepared for ligation to 
the cDNAs. To produce vector fragments for insertion into cDNAs for screening, 
the GFP screening vector was digested with Sma I or Sfi I and Not I which excises 
the fragment containing the TNF R2 gene. For the screen, the cDNA fragments 
were inserted into either the Sma I or Sfi I / Not I sites of the pUC 19-GFP vector in 
the ligation reaction. Colonies were identified by the presence of GFP in the screen. 
The cDNA was excised and ligated into the phagemid vector (pCANTASE) as 
previously described. The amino acid sequence of GFP is shown below: 

GRHHGEQGRGAVHRGGAHPGRAGRRRKRPQVQRVRRGRGRCHLR 

QADPEVHLHmQAARALAHPRDHPDLRRAVLQPLPRPHEAARLLQV 

RHARRLRPGAHHLLQGRRQLQDPRRGEVRGRHPGEPHRAEGHRLQ 

GGRQHPGAQAGVQLQQPQRLYHGRQAEERHQGELQDPPQHRGRQR 

AARRPLPAEHPHRRRPRAAARQPLPEHPVRPEQRPQREARSHGPAGV 

RDRRRDHSRHGRAVQVRTQISIV (SEQ ID NO: 197). 

EXAMPLE 18 

KANAMYCIN SCREENING FOR cDNAs WITH "IN FRAME ORFs" 
[00246] The GFP screen described above resulted in false positives and had a 
very high background. Replacing GFP with an antibiotic resistance gene would 
result in a much lower background and a reduction in the number of plates required 
to screen a library since only those colonies expressing the antibiotic resistance 
protein would grow on the plate. 

Construction of Kan Screening Vector Frame 1 
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[00247] To construct the Kan screening vector, the kan resistance gene, 
obtained by PCR from the pZERO-2 vector (Invitrogen), was inserted into the GFP 
screening vector, replacing the GFP gene. The PCR fragment containing the kan 
resistance ORF contained no start codon and 2 stop codons and the sequence was 
modified with a Not I site at the 5' end and an Nde I site at 3' end using PCR. The 
GFP screening vector was cut at the Not I and Nde I sites and ligated to PCR 
fragment containing the kan resistance gene at the same sites. The ligation replaces 
the GFP gene with the kan resistance gene. For the cloning, the Kanamycin 
resistance gene was obtained from vector pZERO-2 using PCR. For the PCR 
reaction, 10 ng of pZERO-2 vector (Invitrogen), 1/J of 10#M each of: 

Kan Forward primer: 

5 ' -GC ATT AATCTATAGCGGCCGCC ATTGAACAAGATGGATTG 

CACGCAGG-3' (SEQ ID NO: 198); and 
Kan reverse primer: 

5 ' -GC ATT ACTATCCC ATATGTC ATC AGAAGAACTCGTC AAGA 

AGGCG-3' (SEQ ID NO: 199), 
10/J 5X ThermalAce buffer, 1^1 of 50X dNTPs, and 1/J ThermalAce polymerase 
(Invitrogen) were combined in a total volume of 50^. PCR performed in MJ 
Research TETRAD cycler under the following conditions: one cycle of 95 °C for 5 
min, 30 cycles of 60 °C for 45sec ,72 °C for 60 sec, 94 °C for 45 sec, and a final 
cycle of 72 ° C for 5 min). Both the kanamycin resistance PCR product and the GFP 
pUC 19-GFP screening vector were digested with Not I and Nde I as above. Both 
vector and insert were chloroform extracted and ethanol precipitated. In the ligation 
reaction, 100 ng of digested PCR product and 50 ng of digested GFP- pUC19 were 
combined and incubated overnight at 15°C using T4 DNA ligase (New England 
Biolabs). After incubation, the DNA was transformed into TOP 10 one shot 
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competent cells (Invitrogen) and plated on kanamycin (2^tg/ml) plates. Colonies 
were both amp and kan resistant. The insert was confirmed by PCR and sequencing. 
The complete translation product from the kan screening vector Lac promoter was 
the leader sequence MAPDAVFSPYASVRYFTPH (SEQ ID NO: 192; i.e., the first 
19 amino acids of the GFP fusion protein) followed by the extracellular sequence of 
TNF R2 and kan resistance protein. To produce vector fragment for insertion of 
cDNAs for screening, the vector was digested with Sma I or Sfi I and Not I which 
excised the fragment containing the TNF R2 gene. 

[00248] Screening involved inserting the cDNA fragments into either the Sma 
I or Sfi I / Not I sites of the vector in the ligation. Colonies were identified for kan 
resistance in the screen as described above. The amino acid sequence of the kan 
resistance gene is as follows: 

Sequence of KAN ORF: 

IEQDGLHAGSPAAWVERLFGYDWAQQTIGCSDAAVFRLSAQGRPVL 

FVKTDLSGALNELQDEAARLSWLATTGVPCAAVLDVVTEAGRDWL 

LLGEVPGQDLLSSHLAPAEKVSIMADAMRRLHTLDPATCPFDHQAK 

HRIERARTRMEAGLVDQDDLDEEHQGLAPA£LFARLKASMPDGEDL 

VVTHGDACLPNIMVENGRFSGFIDCGRLGVADRYQDIALATRDIAEE 

LGGEWADRFLVLYGIAAPDSQRIAFYRLLDEFF (SEQ ID NO: 200) 

The following are the primers used to sequence cDNA fragments in the screen. 
Puc 19 reverse primer: 

5 ' -GCTTCCGGCTCGTATGTTGTGTGGAATTG-3 * (SEQ ID NO: 

201); 
Kan reverse primer: 
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5'-CAGTCATAGCCGAATAGCCTCTCCACC-3' (SEQ ID NO: 
202). 



Construction Of Kan Screening Vectors Frames 2 And 3 

[00249] The Frame 2 and Frame 3 Kan screening vectors derived from the 
Frame 1 Kan vector (above) had three modifications differentiating them from the 
Frame 1 vector. In the first modification, Frame 2 had one base inserted before the 
5' Sma I cloning site and Frame 3 had two bases before the 5' Sma I cloning site. 
The promoter was shifted out of frame with the kan resistance gene so in the 
screening vector no TNF R2 -kan resistance protein fusion was produced as in 
Frame 1 above, and therefore there was no kanamycin resistance. Resistance was 
only restored when a cDNA of either reading frame 2 or 3 was inserted. The null 
background of relegated vectors digested with Sma I was also zero, unlike the 
Frame 1 vector above which was kan resistant and lead to significant background. 
The second and third modifications were the addition of a unique spacer and a 5' 
Myc tag. Any expression screening vector may be constructed to have a tag that 
differentiates it from target peptide binders. The tag includes, but is not limited to, 
5' Myc epitope tag, FLAG, HA, and any unique sequence that facilitates 
sequencing. The sequence of the Myc tag is EQKLISEEDL (SEQ ID NO: 203). 
The Myc tag was located 3' to the Sfi site. The linker sequence, consisting of 
PAGGAMAA (SEQ ID NO: 204), was inserted between the Sfl I site and the Myc 
tag. The 5' Myc tag was more likely to be visible to antibodies than a buried 3' E- 
tag. Also, the addition of the spacer and Myc tag in Frames 2 and Frame 3 enabled 
differential sequencing at the phagemid level (pCANTAB5E) of cDNA fragments in 
Frame 2 and 3 compared to "peptide" inserts in the same vector (pCANTABSE). 
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The two different forward sequencing primers for the differential sequencing are 
shown below. 

PAMA forward "peptide" sequence forward primer: 

GCC CAG CCG GCC ATG GCC (SEQ ID NO: 205) 

Myc forward 1 (Version A for Frame 2 and 3): 

GGC CGC CGA ACA GAA ACT GAT TAG C (SEQ ID NO: 206) 

Myc forward 2 (Version B for Frame 2 and 3): 

GAT GGC CGC CGA ACA GAA ACT G (SEQ ID NO: 207) 

[00250] For the cloning, the PCR was carried out in a total volume of 50A 
containing 10 ng of pUC 19- GFP screening vector with stuffer, j^l of 10^M each of: 

Myc frame 2 forward primer: 

CCGAACAGAAACTGATTAGCGAAGAAGATCTGCCCGGGTTGC 

CCGCCCAGGTGGC (SEQ ID NO: 208), or 
Myc frame 3 forward primer: 

CCGAACAGAAACTGATTAGCGAAGAAGATCTGCCCGGGTTTG 

CCCGCCCAGGT (SEQ ID NO: 209), and 
TNFR2 reverse primer 

GATTTTCGCGGCCGCGTCGCCAGTGCTCCCTTCAGC (SEQ ID 

NO: 210), 

1(^1 5X ThermalAce buffer, 1^ of SOX dNTPs, and 1^1 ThermalAce polymerase 
(Invitrogen). PCR performed in a MJ Research TETRAD cycler (one cycle of 95 °C 
for 5 min, 30 cycles of 60 °C for 45 sec, 72 °C for 60 sec, 94 °C for 45 sec, and a 
final cycle of 72 °C for 5 min). Each of the PCR reactions were digested with Sfi I 
and Not I. Also kanamycin screening vector Frame 1 (l#g) was digested with Sfi I 
and Not I. For the ligation, 100 ng of digested PCR product was combined with 50 
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ng of digested kanamycin screening vector and incubated overnight using T4 DNA 
ligase (New England Biolabs). After the incubation, the DNA was transformed into 
TOP 10 one shot competent cells (Invitrogen) and plated on kanamycin (2^&g/ml) 
plates. Colonies were both AMP and Kan resistant. The Insert was confirmed by 
PCR and sequencing. The final vectors were Frame 2 Myc Kan pUC19 (Kan 
screening vector Frame2) and Frame 3 Myc Kan pUC19 (kan screening vector 
Frame 3). 

EXAMPLE 19 

CLONING EXPRESSION-SCREENED cDNAs INTO PHAGE DISPLAY 

VECTOR pCANTABSE 
[00251] For each cDNA fragment which corresponded to one screen, 1 0 5 
clones were plated corresponding to the known diversity of the human genome. In 
the case of the green fluorescent protein (GFP) screen, the ligations were 
electroporated into TOP 10 cells followed by growth for 1 hour in 2XYT-G at 37°C. 
Colonies were plated on TC plates (2xYT-A +IPTG, 245 X 245 X 25mm) so that 
single colonies were well separated and the plates were incubated at 37°C overnight. 
Individual green colonies were selected and analyzed by PCR and sequencing. PCR 
analysis showed insert positive clones which contained ORFs (as analyzed by 
sequencing analysis). For the purpose of amplification, individual colonies were 
grown as a pool or individually in 2xYT-AG until the final amount of ampicillin 
resistant (amp^cells was approximately 2-5 X 10 u cells. 

[00252] In the case of the kanamycin screen, the ligations were electroporated 
into TOP 10 cells followed by growth for 1 hour in 2XYT-G at 37°C. Colonies 
were plated on TC plates (2xYT-K +IPTG, 245 X 245 X 25mm) so that single 
colonies were well separated and the plates were incubated at 37°C overnight. The 
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colonies were scraped into PBS and 2xYT-KG media was inoculated for 
amplification until the amount of kanamycin (kan 1 ) cells was approximately 2-5 X 
10 n . After amplification, cells were pelleted and plasmid preparations were carried 
out. In general, 2-5 XI 0 11 cells yielded 200-40^ of DNA quantitated by optical 
density. Multiple screens were carried out until the targeted diversity was achieved 
for the library. In order to excise the ORF inserts from the purified plasmid 
preparations, I7(fag of plasmid was digested in a reaction containing Not I l^ul (50 
units/y), 1 X NEB Buffer 3, and IX BSA in a total volume of 80£fcJ. The tube was 
incubated for 2 hours at 37 °C. After the incubation, the DNA was phenol- 
chloroform extracted and precipitated with isopropanol/sodium acetate. The DNA 
pellet was resuspended in 100^1 H 2 0. The DNA was then digested with in a reaction 
containing Sfi I 30 1 (20 units^l), 1 X NEB Buffer 2, and IX BSA in a total volume 
of 800^. The tube was incubated for 2 hours at 50 °C. After incubating, the 
digestion was run on a 1% agarose gel to visualize the DNA "smear". The gel 
portion which contained the DNA smear was cut out purified. The DNA was 
purified using QIAquick (Qiagen) and quantitated by optical density. The purified 
DNA corresponding to ORFs was ligated to the pCANTAB5E (phage display 
vector) amp or tet using a molar ratio of 1 :5. Production of cell and phage display 
libraries was carried out as described below. 

EXAMPLE 20 
CONSTRUCTION OF CELL LIBRARIES 
[00253] To prepare electrocompetent cells, an overnight culture of E.coli TGI 
cells (F'traD361acI q A (lacZ)Ml 5proA+B+/supEfl[hsdM-mcrB)5r k -m k -McrB-thi4(lac- 
proAB) was diluted to an optical density of O.D. 600 = 0.05-0.1 (1:100) in 500 ml 
2xYT, then grown at 37° C in 4 liter Erlenmeyer flasks to an O.D. 600 = 0.5 - 0.6. 
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The culture was poured into pre-chilled centrifuge bottles and incubated on ice for 
30 minutes prior to centrifugation at 2000 x g for 10 minutes (2°-4° C). The 
supernatant was poured off, and the cell pellet was resuspended in a total of 400 ml 
of ice-cold sterile distilled water. The process of centrifugation and resuspension 
was repeated two more times. After the last centrifugation, the pellet was 
resuspended in a total of 25 ml of ice-cold sterile water containing 10% glycerol. 
The cell suspension was transferred to pre-chilled 35 ml centrifuge bottles, and was 
then pelleted at 2000 x g for 10 minutes at 2°-4° C. The cells were then suspended 
in 0.3 ml of ice-cold sterile 10% glycerol solution, aliquoted, and snap-frozen. The 
aliquots were stored at -80° C. 

[00254] The ligation product was purified using QIAquick spin columns 
(Qiagen) with 3 consequent washes and eluted with sterile deionized water. 
Electrotransformations were performed at 1500 Volts in electroporation cuvettes 
(O.Unm gap; 0.1 ml volume) containing 100-300 ng of DNA and 40-80 L of TGI 
electrocompetent cells. Immediately after the pulse, 1-2 ml of pre- warmed (40° C) 
2xYT medium containing 2% glucose (2xYT-G) was added and the transformants 
were grown at 37° C for 1 hour at 250 rpm. Cell transformants were pooled, the 
volume measured, and an aliquot was plated onto 2xYT-G containing 109#g/ml 
ampicillin (2xYT-GA) or 5(^g/ml tetracycline (2xYT-GT) plates to determine the 
total number of transformants and to perform PCR and sequence analysis of 
randomly selected clones. 

[00255] To amplify the library, the transformants were plated onto 2xYT-GA 
or 2xYT-GT TC dishes (245 x 245 x 25 mm) in order to obtain individual colonies 
(2-8 x 10 3 cfu/plate) and grown for 16-20 hours at 30° C. The cells were collected 
by scraping colonies into sterile PBS for a final concentration about 1 x 10 10 cfu/ml. 
Sterile glycerol was added to a final concentration of 10%; the library was aliquoted 
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and stored at -80° C. Exact concentration of transformants was determined by 
plating an aliquot onto 2xYT-GA or 2xYT-GT plates. 

Phage Rescue of cDNALIBs 

[00256] Phage rescue of the library was carried out using the standard phage 
preparation protocol with the following changes. At least 10 8 cfu/ml was inoculated 
with 2xYT-GA or 2xYT-GT media. Cells were grown to O.D.^0.6-0.8 at 37° C 
with shaking (250 rpm). Helper phage (M13K07) was then added (MOI=15), and 
the cells were incubated for 30 minutes at 37° C with shaking (250 rpm), followed 
by 30 minutes at 37° C without shaking. Following infection, cells were pelleted 
(2000 x g, 15 min) and the supernatant containing the helper phage was discarded. 
The cell pellet was resuspended in 3 times initial culture volume of 2xYT-A or 
2xYT-T media for recombinant phage amplification (5 hours, 37° C, 250 rpm). The 
solution of phage in medium was then separated from cells by centrifugation (3000 
x g, 30 min, 4° C), passed through a 0.4^m filter, aliquoted and stored at -80° C. 
The phage was titered by infecting TGI competent cells and plating onto 2xYT-GA 
or 2xYT-GT media. The phage titer was 10 9 -10 n cfu/ml. 

Construction of Breast Tumor cDNALIB Libraries 

[00257] The poly A+ RNA ( l£g) isolated from breast tumor tissue was used to 
produce cDNA. To carry out the first strand synthesis, the RNA, the 3' SMART 
CDS Primer IIA (1^1) and the modified 5' Smart Primer (l^M) were added 
together for a final volume of ^1. The tube was incubated at 72 °C for 2 minutes to 
denature the RNA and primer. The tube was placed on ice for 2 minutes and then 
centrifuged at 14,000 rpm for 20 sees to collect the contents. To the 5^1 final 
volume, fy) (5 X First Strand Buffer), y\ DTT (20 raM), 1^1 dNTP (10 mM), andj^l 
Power Script Reverse Transcriptase were added to carry out the first strand reaction. 
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The contents of the tube were mixed gently and incubated at 42°C for 1 hr and then 
placed on ice to terminate the reaction. To carry out the PCR amplification for 
producing double stranded cDNA,^J of the subtracted mRNA first strand reaction 
or 0.4^ un-subtracted mRNA first strand reaction was used in the PCR reaction 
containing lC^KlOX Advantage 2 Buffer ), 2^1 (SOX dNTPs),^J modified Smart 
Primer, AjX 5' PCR primer, and 2jj\ Advantage 2 Enzyme in a total volume of lOp^l. 
For the PCR amplification rounds, one preliminary cycle (95°C, 1 min) and then 24 
cycles were carried out (95°C for 5 sec, 65°C for 15 sec, 68°C for 6 min). The 
cDNA that resulted was column purified using QIAquick™ columns (Qiagen) and 
quantitated by Picogreen Method (Molecular Probes; Eugene, OR). In order to 
obtain preferred size fragments ranging from lOObp to 3000bp, more preferably 
from 200bp to 2000bp for insertion into the phage display vector, the initial long 
cDNA fragments were digested with multiple enzymes with known infrequent 
cutting profiles in the mammalian genome. The digestions were conducted in the 
following manner. For each digestion, 5^ of each enzyme, 10 X Buffer (15^1), and 
100X BSA were added to 5#g of the double stranded cDNA PCR product. The total 
volume for each reaction was 15(^1. The digestions were as follows: 

Digestion Reaction 1 containing enzymes Nar I, Eco Rl, and Avr II and NEB Buffer 
1 was carried out for 4 hours at 37 °C. 

Digestion Reaction 2 containing enzymes EcoRl, Hind III, and Xho I and NEB 
Buffer 2 was carried out overnight at 37°C. 

Digestion Reaction 3 containing enzymes Bam HI, Xba I, and Avr II and NEB 
Buffer 2 was carried out for 4 hours at 37°C. 

Digestion Reaction 4 containing enzymes Bgl II, EcoRl Xho I and NEB Buffer 3 

was carried out for 4 hours at 37°C. 

[00258] 
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[00259] The DNA in each digestion reaction was chloroform extracted and 
precipitated with isopropanol and the 5' and 3' overhangs were filled in by the Pfu 
DNA Polymerase polishing reaction. The precipitated DNA was resuspended in 20 
* 1 water and the polishing reaction was carried out in a total volume of lOtyul 
containing lQ>ul (10 X Buffer) 10>ul (lOmM dNTPs) and Itycd Pfu DNA Polymerase 
(2.5 units^J). The reaction was incubated at 72 °C for 30 minutes after which the 
cDNA was chloroform extracted, precipitated and resuspended in 5(^1 water. DNA 
quantitation was performed by the Picogreen method or by optical density. The 
cDNA was ligated into the Sfi-Sma site of pCANTABSE phage display vector using 
a molar ratio of 1 : 10 and 1 :20. The cell library was produced as described above 
and colony PCR was carried out on individual colonies. 

[00260] Colony PCR involved using the PCR primers which flanked the 
inserted cDNA fragment producing the multi-size fragments indicating a good range 
of DNA lengths (300-1000bp) as was desired. Also the ligation efficiency was close 
to 100% with a few PCRs indicating a vector re-ligated fragment. However, in 
order to produce a library of ORFs, approximately 5 X 10 11 cells from the cell 
library were pelleted and a plasmid preparation was produced (Qiagen). The DNA 
was ligated into the GFP expression vector and the expression screen was carried 
out as described above. Following the expression screen, the cell and phage 
libraries (both amp and tet) in the pCANTABSE phagmid vector were produced as 
previously described above. PCR analysis of the cDNA inserts in the phage and cell 
library show the efficiency of ligation to be 100% among the 36 clones (cell library) 
and 34 clones (phage library) examined. Furthermore the inserts ranged in size from 
400bp to lOOObp as expected. 
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EXAMPLE 21 
CONSTRUCTION OF INDIVIDUAL TARGET(S) 

Construction of DGI 2 -Gene III Fusion Proteins in pCANTABSE (tet) 
[00261] DGI-2 was identified as an uncharacterized gene in a public database 
designated KIAA0101 (GenBank Acc. No. XM_042258). The DNA sequence that 
encodes DGI-2 is shown in Figure 3. 

[00262] An IMAGE clone containing the DGI-2 sequence (IMAGE: 
3838690; accession number BE748524) was obtained from the American Type 
Culture Collection (ATCC, Manassas VA). The clone included 770 bp and 
encompassed the entire DGI-2 open reading frame (ORF). 

[00263] The 400 base pair DNA sequence encoding the target protein DGI-2 
protein (111 amino acid residues), was cloned into the Sfi I and Not I restriction 
sites in the pCANTABSE (tet) vector. The resulting fusion protein consisted of the 
DGI-2 protein fused to the C-terminus of the genelll signal sequence and to the N- 
terminus of the gene III protein. For ligation of the sequence encoding the target 
protein into the vector, primers were designed to add the appropriate restriction sites 
and amino acids for signal peptide cleavage at the 5' and 3' end of the sequences. 
The primer sequences were as follows: 

Forward ligation primer :5' TgT TCC TTT CTA TgC ggC CCA gCC ggC 
CAT ggC gAT ggT gCg gAC TAA AgC AgA 3' (SEQ ID NO: 21 1) 

Reverse ligation primer : 5' CgA AAT CTT TTg gAC TCA CAC TgC ggC 
CgC TTC TTT TTC ATC ATT TgT gTg ATC 3' (SEQ ID NO: 212). 
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[00264] The primers were used in the PCR reaction with a high fidelity DNA 
polymerase (ThermalAce, Invitrogen ) using the IMAGE clone containing the DGI- 
2 sequence (IMAGE: 3838690; Accession Number BE748524) as the template to 
generate the insert for ligation. The pCANTABSE vector plasmid was digested with 
Sfi I and Not I and the vector fragment was gel purified. The vector (100 ng), insert 
(50-100 ng), and 0.5 \il of ligase (BM) were mixed in a 10 \xl ligation reaction and 
incubated over night at 15°C. The ligation product was purified and electroporations 
were performed with Escherichia coli strain TGI electrocompetent cells ((F' traD36 
lacl q A(lacZ)M15 proAB) / supE A(hsdM-mcrB) 5 r k "m k - McrB ) thi A(lac-proAB). 
Cell transformants were pooled and an aliquot was plated on plates containing 10 
|Lxg/ml tetracycline. The correct constructs were determined by colony PCR 
followed by sequence analysis using primers for the pCANATABSE vector. 

Construction of Hras - GENE III Fusion Proteins in pCANTABSE (tef) 
[00265] Hras was identified from GenBank accession number NM_005343 - 
Homo sapiens v-Ha-ras Harvey rat sarcoma viral oncogene homolog (Hras) mRNA. 
The Genbank sequence was used to design cloning primers. The primers extended 
into the 5' and 3' untranslated regions and generated the 570 base pair open reading 
frame. Cloned from Human Universal QUICK-Clone™ cDNA from Clontech using 
ThermalAce high fidelity polymerase from Invitrogen. Sequence confirmed by 
sequencing after sub cloning into pZERO cloning vector from Invitrogen. 

Forward cloning primer: 

5 '-CCCTGAGGAGCGATGACGGAATATAAGCTG-3 ' (SEQ ID NO: 213) 
Reverse cloning primer: 

5'-GTCCCCCTCACCTGCGTCAGGAGAGCACAC-3' (SEQ ID NO: 214). 
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[00266] The Hras sequence was not activated. Activated Hras was created by 
substituting the base G to T at position 29 as demonstrated in oncogene T24. The 
base substitution was performed using a mutagenesis primer, the above reverse 
cloning primer, and PCR using ThermalAce high fidelity polymerase from 
Invitrogen. 

Activation forward primer 

5 ' - ATGACGGAATATAAGCTGGTGGTGGTGGTCGCCG-3 ' (SEQ ID NO: 
215). 

Construction of Leptin ligand - GENE HI Fusion Proteins in pCANTAB5E (teO 
[00267] Leptin ligand was identified from GenBank accession number 
NM_000230 - Homo sapiens leptin (obesity homolog, mouse) (LEP) mRNA. The 
Genbank sequence was used to design cloning primers. 

[00268] The primers extended into the 5 ' and 3 * untranslated regions and 
generated the 501 base pair open reading frame. Cloned from Human Universal 
QUICK-Clone™ cDNA from Clontech using ThermalAce high fidelity polymerase 
from Invitrogen. The sequence was confirmed by sequencing after subcloning into 
the pZERO cloning vector from Invitrogen. 

[00269] Forward cloning oligo 

5 ' -CC ATCCTGGGAAGGAAAATGC ATTG-3 ' (SEQ ID NO: 216) 
Reverse cloning oligo 

5 '-AGGGAAATTGAC AGAGTCCTGGATAAGGGG-3 ' (SEQ ID NO: 217). 
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[00270] Both Hras and Leptin genes were cloned into Sfi I and Not I sites of 
the pCANTABSE (tet) vector using PCR. The following forward primers added the 
Sfi I site and required signal sequence to activate Hras and Leptin ligand. 

Hras forward primer 

5 ' -CTT AAAGCTTGGCCC AGCCGGCC ATGGCGGAATATAAGCT 
GGTGGTGG-3' (SEQ ID NO: 218) 
Leptin Ligand forward primer 

5'-GCGGCCCAGCCGGCCATGGCGCATTGGGGAACCCTGTGCG 
GATTC-3' (SEQ ID NO: 219) 

The reverse primers added the required Not I cloning site. 
Hras reverse primer 

5 ' -GC AATTCTCTGAGCTCGCGGCCGCGGAGAGC AC AC ACTTG 
C AGCTC ATG-3 ' (SEQ ID NO: 220) 
Leptin Ligand reverse primer 

5'-GATTCCTTGCGGCCGCGCACCCAGGGCTGAGGTCCAGCT 
GC-3' (SEQ ID NO: 221) 

PCR products were cut with Sfi I and Not I (NEB) and ligated into pCANTABSE 
cut with Sfi I and Not I. Fusion proteins with Hras or Leptin Ligand fused to the C- 
terminus of the gene III signal sequence and to the N-terminus of the gene III 
protein resulted. 

EXAMPLE 22 

DRUG DISCOVERY USING TARGET: PARTNER PAIRS FOR HIGH- 
THROUGHPUT SCREENING 
[00271] Proteome Panning may be directly used for drug discovery. Listed 
below are non-limiting examples of methods that can be used to develop assays for 



WO 03/035839 



PCT/US02/34021 



- 116- 

high-throughput screening. The basic idea in all of the assays is to identify 
compounds from combinatorial libraries that inhibit target: target binder interaction. 

ELISA Method I 

[00272] In this approach, the target phage and partner phage express separate 
distinguishing tags including, but not limited to, myc, FLAG, etag, hexahistidine, 
etc. Initially, the target and partner phage are separated as described previously. 
Briefly, target phage is captured on plates previously coated with the Tag-specific 
antibody overnight at 4°C. The partner phage is added followed by compounds 
from a small molecule combinatorial library at a final concentration of 2^M. 
Following a 1 hour incubation at room temperature, the plates are washed and 
probed with an antibody specific for the partner phage specific-Tag and labeled with 
a reporter including, but not limited to horse radish peroxidase, fluorescin, 
luciferase, etc. After a second washing step, the chromogen is added and the plates 
read in the appropriate instrument 

ELISA Method II 

[00273] In this approach, the target is first synthesized by in vitro translation 
and then screened as described above in ELISA Method I. 

Cloning and in vitro expression of Target polypeptide : 

[00274] The DNA coding sequence for the target is directly cloned into the 
pIVEX 2.3d circular expression vector (Roche) from the pCantab vectors using the 
following primers 

Forward primer 
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5 , CTT TAA GAA GGA GAT ATA CCA TGG ccc AGC CGG CCA 

TGGC-3' (SEQIDNO:225) 
Reverse primer: 

5'- TGA TGA TGA GAA CCC CCC CCC GGA TAC GGC ACC GGC 

GCACC-3' (SEQ ID NO: 226) 
[00275] for use in the Rapid Translation System RTS 500 (RTS 500 E. coli 
HY kit, Roche Diagnostics GmbH). In each case, the coding sequence is inserted 
between the 5' Nde I and 3' Xho I sites, in frame with the his-tag provided in the 
vector. Expression reactions are performed by pipetting 10 ml Feeding Solution and 
1 ml Reaction Solution into their respective compartments in the reaction device 
(provided with the kit). Following this, 10 ng of plasmid DNA is added to each 
reaction mixture and the reaction is allowed to continue for 24 h at 30°C in the RTS 
500 Instrument prior to harvesting the expressed protein. 

Solubilizing, refolding and purification of insoluble proteins : 
[00276] Analysis of protein products from the in vitro system indicates that 
adequate expression levels may be obtained. However, high expression levels cause 
segregation of the protein products to the insoluble fraction of the expression 
reaction. The expressed proteins are thereby recovered as insoluble pellets, are 
denatured and subjected to a standard refolding regimen prior to use in panning 
experiments or other assays. After overnight expression at 4°C in the Roche RTS in 
vitro expression system, the reaction mixtures are removed from their disposable 
reaction vessels. The insoluble fractions are pelleted by a 10 min centrifiigation at 
maximum speed in a micro fuge. The pellet from each reaction mixture is dissolved 
by gentle vortexing in 1.0 ml Denaturation Buffer (50 mM TRIS, 8 M urea, 20 mM 
^-mercaptoethanol). Another 10 min spin in the microfuge removes insoluble 
material. Bradley assay (Bio-Rad Protein Assay) determines the protein 
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concentration of the soluble fraction. For refolding, aliquots of solubilized proteins 
are diluted to a protein concentration of 25 fig/ml in Denaturation Buffer and the 
diluted mixtures are loaded into 10,000 molecular weight cut off dialysis cassettes 
(Slide- A-Lyzer®, Pierce). Each 10 ml dilution mixture is dialyzed against three 
changes of 4L Dialysis/Refold Buffer (50 mM ammonium bicarbonate, 100 mM 
NaCl, pH 9.0) at 4°C. The first two changes are performed for a minimum of 2 h, 
each. The final dialysis is carried out overnight. The his-tagged proteins are 
purified by loading them onto a 1.6 x 1.2 cm Ni-Superflo column (2.4 ml, 
QIAGEN) equilibrated with Equilibration Buffer (50 mM NaH 2 P0 4 , 300 mM NaCl, 
10 mM imidazole, pH 8.0) followed by elution with Equilibration Buffer containing 
500 mM imidazole. 

[00277] Target is captured on Ni"*" 1 " coated plates overnight at 4°C. The 
partner phage is added followed by compounds from a small molecule 
combinatorial library at a final concentration of 2<)4iM. Following a one hour 
incubation at room temperature, the plates are washed and probed with an antibody 
specific for the partner phage specific-Tag and labeled with a reporter including but 
not limited to horse radish peroxidase, fluorescin, luciferase, etc. After a second 
washing step, the chromogen is added and the plates read in the appropriate 
instrument 

Microarravs 

[00278] In this approach, microarrays consisting of 1 0,000-30,000 chemical 
compounds from a combinatorial library are arrayed on microscope slides using an 
instrument such as the OmniGrid® Microarray (Gene Machines). The slides can be 
uncoated or coated with various chemicals such as bovine serum albumin, glycerin, 
etc. and the small molecules can be plated directly onto the solid phase, e.g. solid 
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support, matrix or slides, or captured with linkers. The target and partner phage 
with distinguishing tags (e.g., myc, FLAG, etag, etc.) are added together in an 
incubation chamber. After a short incubation period, the phage are washed off and 
binding is probed with anti-target antibody conjugated to the fluorescent dye Cy3 
and an anti-partner antibody conjugated to the fluorescent dye Cy5. Slides are read 
in any Microarray reader such as the GenePix 4100A or 4000B (Axon Instruments, 
Inc.). If the combi-chem compound binds to the domain on the target recognized by 
the partner (i.e., hotspot), only the Cy3 signal will be visualized. If the small 
molecule binds outside the hotspot, both antibodies can bind and a combined 
Cy3/Cy5 can be read. Conversely, compounds binding to the partner can also be 
detected in the same assay by the Cy5 signal. 

High-throughput ALPHASCREEN™ assay 
Compound library : 

[00279] A library of compounds synthesized by combinatorial chemistry 
techniques may be obtained from a supplier of in-house designed and synthesized 
novel drug- like organic molecules for high throughput screening. 

Assay development: 

[00280] Bead-based technology established by the principle of luminescent 
oxygen channeling (Ullman et al., 1996, Clin. Chem. 42:1518-1526; Ullman et al:, 
1994, Proc. Natl Acad. Sci. USA 91:5426-5430) is commercially available (e.g., 
ALPHAScreen; PerkinElmer). This assay format offers the advantages of being 
homogeneous, fluorescence-based, and easy to miniaturize or reduce for robotics. 
In addition, the format does not suffer from the distance limitations of TR-FRET 
(time-resolved fluorescence resonance energy transfer) assays. The detection limit 
of ALPHAScreen™ assays is 200 nm, whereas the detection limit of TR-FRET 
assays is 9 nm. 
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lo 0 2 8 1 ] A mixture of target ( K^l) (6xHis-tagged) and binder (myc-tagged) 
phage expressing target and partner respectively are added to 384-well low volume 
polystyrene microplates (e.g., ProxiPlate™-384; PerkinElmer Life Sciences, Boston, 
MA). Nickel-conjugated acceptor beads and anti-myc conjugated donor beads are 
added at a final concentration of 20/g/ml. The final assay volume is 2^J. Plates are 
then incubated for 60 min in the dark at room temperature. At the end of the 
incubation period, the fluorescence signal at 520 nm is read on a plate reader (e.g., 
Fusion-a HT; PerkinElmer). The values are expressed as AlphaScreen™ activity in 
counts per second (cps) and plotted against Log (Target phage) (CFU/ml). Assay 
optimization may include variation of phage concentrations, and variation of 
detector reagents (acceptor and donor bead concentrations), order of reagent 
addition and time of incubation. 

High-throughput methods : 

[00282] Relative potencies of combinatorial chemistry compounds as 
compared to binder phage are analyzed in a competition system. Detection of the 
target binder interaction will be measured in an amplified luminescent proximity 
homogeneous assay (ALPHAScreen™; PerkinElmer). The assay may be performed 
in 384-well low volume polystyrene microplates (e.g., ProxiPlate™-384; 
PerkinElmer Life Sciences) with a final volume of 20 pi. Final incubation 
conditions are combi-chem compounds at 10~ 5 M, 0.025 M HEPES (pH 7.4 at 
25°C), 0.100 M NaCl, 0.1 % BSA (Cohn Fraction V; Sigma Chemical Co., St. 
Louis, MO), 5-20 |ag/ml nickel-conjugated acceptor beads, and 5-20 fag/ml antimyc- 
conjugated donor beads. 

[00283] First, target and binder phage, and combi-chem compound are 
incubated for 2 h at room temperature. Next, acceptor and donor beads are added 
and the incubation is continued for an additional 1 h. At the end of the incubation 
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period, the fluorescence signal at 520 nm is read on a plate reader (e.g., Fusion-a 
HT; Packard Bioscience Co.). Primary data are analyzed, e.g., by importing into 
ActivityBase™ (ID Business Solutions Ltd, Guildford, UK), background corrected, 
normalized to buffer controls and then expressed as percent specific binding. Data 
is be validated if the Z'-factor (Zhang et aL, 1999, J. Biomol Screen, 4:67-73) for 
this assay is expected to be greater than 0.7 (Z f = l-(3cr + +3a )/||u + -|a |) and the signal- 
to-background (S/B) ratio is expected to be between 30 and 300. 

IC« n analysis of "HITS" : 

[00284] HITS are evaluated in a competitive binding assay to determine their 
ability to disrupt target:binder interaction in a dose-dependent manner. This allows 
an accurate estimation of HIT potency, IC 50 , relative to phage concentration. The 
data are fit to a four-parameter non-linear regression analysis (y = min + (max- 
min)/(l+10 A ((logIC 50 -x)*Hillslope)) ), and are used to determine IC 50 values. The 
Z'-factor (Zhang et aL, 1999, 7. Biomol Screen, 4:67-73) for this assay is expected 
to be greater than 0.7 (Z f = 1-(3cx + +3ct_ )/||a + -|i |) and the signal-to-background (S/B) 
ratio is expected to be between 30 and 300. 

[00285] Microarray and ALPH AScreen methods are advantageous in that 
they do not require additional processing steps such as phage separation, sub- 
cloning or purification. However, in certain instances the target: binder phage 
interaction may be disrupted too easily giving too many false positives. Conversely, 
target: binder phage interactions may be too difficult to disrupt and inhibitors may 
go undetected. Also, detection and quantitation of the interaction and the range of 
inhibition that can be detected may be limiting factors. In such cases, the ELISA 
methods may offer an advantage. ELISA method I requires separation of the target 
and binder phage while ELISA method II requires sub-cloning, expression and 
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purification of recombinant target protein. These methods provide cleaner more 
reproducible assays. 

EXAMPLE 23 

INFORMATICS FOR DATA MANAGEMENT AND ANALYSES 
[00286] In order to manage and analyze large volumes of data from proteome 
panning, guidelines for nomenclature and monitor information flow were 
established. 

1 . Easy To Use Naming System 

[00287] This system utilizes excel and visual basic macro to facilitate 
mapping samples in plates to meaningful names. It can help researchers to track 
their sample based on sample name, it also reduces the amount of work necessary 
for naming samples. 

2. Pre-Processing Of Raw DNA Sequence, End Trimming, Vector Trimming 
[00288] Raw sequencing data have several errors, especially at the two ends, 
which includes bad sequence due to sequencing machine's capability and vector 
sequences. Contamination has to be removed from the sequences before processing. 
Two criteria can be used, first, Phred program can be used for base calling based on 
sequence quality, secondly, if there is vector sequence contamination, trimming 
based on sequence characteristics will be used. 



WO 03/035839 



PCT/US02/34021 



- 123- 

3. Automated Blast Search With Pre-Defined Parameters Will Be Performed After 
Sequences Are Transferred To Linux Cluster 

[00289] Sequences were transferred through network, samba server 

running on the linux cluster support connection between Microsoft and linux 
machine. Sequences in one project were put under the same folder. GCG 
commands were called for sequence processing using pre-defined parameters. For 
nucleotide sequence, blastx program was used to dynamically translate query 
sequence in all 6 possible reading frames and compare the resulting protein 
sequence with SwissProt database. Results were stored in the same folder with 
name "file_name.blastx". 

4. Blast Results Parsed And Hits Information Entered Into Access Database 
[00290] Blast result parsers were developed that are capable of parsing blast 
results based on user defined parameters thereby generating reports automatically. 
For proteome panning, no parameter was needed, as all hits in the blast file were 
saved into database. Further query was performed in the database. By doing so, no 
information was lost during the first run of parsing. Sorthits, the main program used 
for proteome panning, generated three files automatically for each run: 

1 . hits.csv: contained all the parsed blast results, all properties were 
separated by comma, this file was imported to database for further 
analysis; 

2. hit.summary: contained each unique hit found in database, together with 
how many times they were found by query sequences, followed by the 
name of query sequence(s) that produced this hit. 
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3. hits.txt: contained information on each and every hit, a short description of 
their identity, start/end portion of query sequence and subject sequence. 
In addition to this, the name of the hit(s) was used to query a database of 
known house-keeping genes and reported whether any hit was one of the 
house keeping genes. 

5 . Database Can Be Queried To Generate Reports 

[00291] The hits.csv files generated by sorthits were imported directly into 
Access database and queried for specific hits. After parsing, the blast hits reported 
were separated into different fields that are relevant for analysis (listed in Table 2, 
example query shown in Fig. 7). 

[00292] The parser utilized the open source blast.pm as its main parse engine. 
Once the result was in the database, queries for any of the different fields, such as a 
specific sequence, organism, or E (expectation) value were easily performed. The 
results were exported in multiple formats to facilitate data sharing. 

6. Hits Information Generated And Verified By Querying Existing Protein -Protein 
Interaction Databases 

[00293] The automatic querying machinery was updated on a regular basis to 
accommodate the change in database design. To obtain protein-protein network 
information, the hits, especially those after verification, were used to query for all 
known protein-protein interaction network database for verification. 

[00294] For example, hit A from Homo sapiens was used to search yeast 
protein interaction network to retrieve the homolog's interaction network in yeast. 
The interaction network thus provided was used to search against the database for 
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potential hits. By iterating this process, a protein interaction network inferred from 
multiple species/organism was generated, stored, and queried for useful information. 

[00295] For hits that have no homologs, their interaction partners' (derived 
from our Phenogenix technology) homologs provided information about their 
potential function, cellular localization, etc. This feature of the database showed the 
power of this technology, not only can it verify known protein-protein interaction 
network, it is also capable of uncovering unknown protein-protein networks. 

[002961 The database was queried. All hits in the catalog were linked to their 
partners. Hits were also linked to various outside databases for their function 
description, map position in genome (if available). The search engine automatically 
downloaded this information to a local server for efficient data access during query. 



TABLE 2 



1 QUERYNAME 


Sequence identifier of the query 


2 QUERYLENGTH 


Full length of the query sequence 


3 SBJCT NAME 


Sequence identifier of the sbjct ("hit") 


4 SBJCT LENGTH 


Full length of the sbjct sequence 


5 EXPECT 


Expect value for the alignment 


6 SCORE 


Blast score for the alignment 


7 BITS 


Bit score for the alignment 


8 NUM_HSPS 


Number of HSPs (not the "N" value) 


9HSP FRAC IDENTICAL 


Fraction of identical substitutions 


10 HSP_FRAC_CONSERVED 


fraction of conserved ("positive") 
substitutions 


11 HSP_QUERY_ALN_LENGTH 


Length of the aligned portion of the query 
sequence 


12 HSP SBJCT ALN LENGTH 


Length of the aligned portion of the sbjct 
sequence 


13 HSP_QUERY_GAPS 


Number of gaps in the aligned query 
sequence 


14HSP SBJCT GAPS 


Number of gaps in the aligned sbjct 
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TABLE2 





sequence 


15 HSP_QUERY_START 


Starting coordinate of the query sequence 


16 HSP_QUERY_END 


Ending coordinate of the query sequence 


17HSP SBJCT START 


Starting coordinate of the sbjct sequence 


18HSP SBJCT END 


Ending coordinate of the sbjct sequence 


19 HSP_QUERY_STRAND 


Strand of the query sequence (TBLASTN/X 
only) 


20 HSP_SBJCT_STRAND 


Strand of the sbjct sequence (TBLASTN/X 
only) 


21 HSP_FRAME 


Frame for the sbjct translation 
(TBLASTN/X only) 


22 SBJCT_DESCRIPTION 
(optional) 


Full description of the sbjct sequence from 
the alignment 


23 Alignments 


Alignment of query sequence and sbjct 
sequence 



TABLE 3 



Clone # 


Lib 


Seauence 


E-Tag 


DGI-9 


LDH 


DGI9 
/LDH 


DGI9-20M-4-A4 14 


20F 


FLLQDFIMDAGSFVKPIGVG 


32.6 


8.7 


1.1 


8.1 


DGI9-20M-4-E3 


20F 


MAGFRGSGLSGYSEARLPRV 


34.5 


7.0 


1.0 


7.1 


DGI9-20M-4-A6 27 


20R 


WDWSRFGAQLRGAILGRATG 


27.2 


6.0 


0.9 


6.5 


DGI9-20M-4-A12 


20R 


SRLALSSQHGLSLVAGRRRG 


20.7 


1.9 


0.9 


2.1 



[00297] As various changes can be made in the above compositions and 
methods without departing from the scope and spirit of the invention, it is intended 
that all subject matter contained in the above description, shown in the 
accompanying drawings, or defined in the appended claims be interpreted as 
illustrative, and not in a limiting sense. 
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[00298] The contents of all patents, patent applications, published articles, 
books, reference manuals, texts and abstracts cited herein are hereby incorporated by 
reference in their entirety to more fully describe the state of the art to which the 
present invention pertains. 
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What is claimed is: 

1 . A method of selecting target and target binder pairs comprising: 

(a) mixing in a reaction vessel phage expressing biological targets and 
phage expressing target binders each having distinguishable selection 
markers; and 

(b) selecting target and target binder pairs based on the selection 
markers. 

2. A method of claim 1, further comprising, transforming host cells with the 
mixed phage of step (a). 

3. The method of claim 2, wherein the selectable marker is selected from a 
group consisting of: tetracycline, ampicillin, and kanamycin. 

4. The method of claim 2, wherein the host cell is selected from the group 
consisting of: bacteria, mammalian, yeast, insect, and plant cells. 

5. The method of claim 4, wherein the host cells are Escherichia coli. 

6. The method of claim 2, wherein the targets in step (a) comprise polypeptides 
encoded by a cDNA library. 

7. The method of claim 2, wherein the targets in step (a) comprise polypeptides 
encoded by a subtracted cDNA library. 

8. The method of claim 2, wherein the targets in step (a) comprise a 
polypeptide of known function. 

9. The method of claim 2, wherein the targets in step (a) comprise a 
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polypeptide of unknown function. 

1 0. The method of claim 2, wherein the targets in step (a) comprise a 
combination of polypeptides. 

1 1 . The method of claim 2, wherein the targets in step (a) comprise an antibody 
library. 

12. The method of claim 2, wherein the target binders in step (a) comprise a 
peptide library. 

13. The method of claim 2, wherein the target binders in step (a) comprise a 
random peptide library. 

14. The method of claim 2, wherein the target binders in step (a) comprise an 
antibody library. 

15. The method of claim 2, wherein the target binders in step (a) comprise 
polypeptides encoded by a cDNA library. 

16. The method of claim 2, wherein the target binders in step (a) comprise 
polypeptides encoded by a subtracted cDNA library. 

17. The method of claim 2, wherein the targets and target binders in step (a) 
comprise polypeptides encoded by a cDNA library. 

18. A cell population comprising a mixture of phage according to step (a) of 
claim 2. 

19. An isolated nucleic acid comprising a nucleotide sequence encoding an 



WO 03/035839 



PCT/US02/34021 



- 130- 

amino acid sequence selected from the group consisting of amino acid sequences of: 
SEQ ID NOs:l-189, SEQ ID NOs: 227-532, and SEQ ID NOs: 534-537. 

20. An isolated nucleic acid comprising a nucleotide sequence that is at least 
80% identical to the nucleotide sequence of claim 19. 

21 . An isolated nucleic acid comprising a nucleotide sequence that is 
complementary to the nucleotide sequence of claim 20. 

22. A vector comprising an isolated nucleic acid selected from the group 
consisting of: a nucleotide sequence encoding an amino acid sequence selected from 
the group consisting of amino acid sequences of: SEQ ID NOs: 1-189, SEQ ID NOs: 
227-532, and SEQ ID NOs: 534-537; a nucleotide sequence that is at least 80% 
identical to the nucleic acid; and a nucleotide sequence that is complementary to the 
nucleotide sequence that is at least 80% identical to the nucleic acid. 

23. A cell comprising the vector of claim 22, wherein the cell is selected from 
the group consisting of bacterial, yeast, insect, mammalian, and plant cells. 

24. A primer comprising the isolated nucleic acid of any one of claims 19-21. 

25. A probe comprising the isolated nucleic acid as in any one of claims 19, 20, 
or 21. 

26. An isolated peptide comprising an amino acid sequence selected from the 
group consisting of amino acid sequence of SEQ ID NOs: 1-1 89, SEQ ID NOs: 227- 
532, and SEQ ID NOs: 534-537. 
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27. The isolated peptide of claim 26, wherein said peptide is a target binder 
selected from the group consisting of: DGI-2, DGI-5, DGI-7, DGI-9, VEGF, 
VEGFR1, VEGFR2, VEGFR3, EGFR, FGFR1 -alpha, FGFRl-beta, and TieL 

28. An isolated peptide comprising an amino acid sequence that is at least 65% 
identical to the amino acid sequence of claim 26 and comprising biological activity 
associated with the peptide. 

29. An antibody that binds to the isolated peptide as in any one of claims 26, 27 
or 28. 

30. The antibody of claim 29 which is monoclonal. 

3 1 . The antibody of claim 29 which is polyclonal. 

32. A target binder library comprising a random peptide phage display library, 
further comprising peptides selected from the group consisting of SEQ ID NOs:l- 
189, SEQ ID NOs: 227-532, and SEQ ID NOs: 534-537. 

33. A method of identifying a target binder comprising screening a target binder 
library of claim 32 for amino acid sequences that bind to a target, wherein binding 
indicates identification of a target binder. 

34. A pharmaceutical composition comprising the vector of claim 22 and a 
physiologically acceptable carrier, excipient, or diluent. 

35. A pharmaceutical composition comprising the cell of claim 23 and a 
physiologically acceptable carrier, excipient, or diluent. 
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36. A pharmaceutical composition comprising the peptide as in claim 26 or 
claim 28 and a physiologically acceptable carrier, excipient, or diluent. 

37. A pharmaceutical composition comprising the antibody as in claim 29 and a 
physiologically acceptable carrier, excipient, or diluent. 

38. A method of treating cancer comprising: administering the pharmaceutical 
composition of claim 34 in an amount sufficient to treat the cancer. 

39. A method of treating cancer comprising: administering the pharmaceutical 
composition of claim 35 in an amount sufficient to treat the cancer. 

40. A method of treating cancer comprising: administering the pharmaceutical 
composition of claim 36 in an amount sufficient to treat the cancer. 

41. A method of treating cancer comprising: administering the pharmaceutical 
composition of claim 37 in an amount sufficient to treat the cancer. 

42. A method of identifying a biologically active compound comprising: 

(a) forming a complex with a target and a target binder identified by the 
method of claim 1; 

(b) incubating the formed complex with a test compound under 
conditions that allow the test compound to bind to the complex; and 

(c) screening for disruption of the polypeptide complex, wherein 
disruption of the complex indicates identification of a biologically 
active compound. 

43. The method of claim 42 further comprising: 
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(a) binding a target-specific solid support to the target, wherein the target 
specific solid support is conjugated with a target selection marker- 
specific label; 

(b) binding a target binder-specific solid support to the target binder, 
wherein the target binder-specific solid support is conjugated with a 
target binder selection marker-specific label; 

(c) forming a target: target binder complex. 

44. The method of claim 42, wherein the target binder is a peptide comprising an 

amino acid sequence selected from the group consisting of: SEQ ED NOs: 1-189, 
SEQ ID NOs: 227-532, and SEQ ID NOs: 534-537; DGI-2, DGI-5, DGI-7, DGI-9, 
VEGF, VEGFR1, VEGFR2, VEGFR3, EGFR, FGFR1 -alpha, FGFRl-beta, and 
Tiel; and an amino acid sequence that is at least 65% identical to the peptide and 
comprising biological activity associated with the peptide. 

45. The method of claim 43, wherein the target binder is a peptide comprising an 
amino acid sequence selected from the group consisting of: SEQ ID NOs: 1-189, 
SEQ ID NOs: 227-532, and SEQ ID NOs: 534-537; DGI-2, DGI-5, DGI-7, DGI-9, 
VEGF, VEGFR1, VEGFR2, VEGFR3, EGFR, FGFR1 -alpha, FGFRl-beta, and 
Tiel; and an amino acid sequence that is at least 65% identical to the peptide and 
comprising biological activity associated with the peptide. 

46. The method of claim 42, wherein the test compound is bound to a solid 
support. 

47. A method of identifying a biologically active compound comprising: 
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(a) incubating a target identified by the method of claim 1 with a test 
compound under conditions that allow the test compound to bind to 
target; 

(b) incubating the target and test compound with a target binder 
identified by the method of claim 1; and 

(c) screening for formation of a complex between the target and the 
target binder, wherein inhibition of formation of the complex 
indicates identification of a compound with biological activity. 

48. The method of claim 47, wherein the target binder is a peptide comprising an 

amino acid sequence selected from the group consisting of: SEQ ID NOs:l-189, 
SEQ ID NOs: 227-532, and SEQ ID NOs: 534-537; DGI-2, DGI-5, DGI-7, DGI-9, 
VEGF, VEGFR1, VEGFR2, VEGFR3, EGFR, FGFR1 -alpha, FGFRl-beta, and 
Tiel; and an amino acid sequence that is at least 65% identical to the peptide and 
comprising biological activity associated with the peptide. 

49. The method of claim 47, wherein the test compound is bound to a solid 
support. 

50. A method of constructing a cDNA phage display library, comprising: 

(a) obtaining a cDNA of interest; 

(b) cloning the cDNA into an expression screening vector specific for a 
selection marker to form a cDNA screening vector; 

(c) transforming the cDNA screening vector into host cells; 

(d) selecting clones containing the cDNA screening vector for the 
selection marker; 
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(e) digesting the selected cDNA screening vector with multiple enzymes 
in combination, thereby producing cDNA fragments; and 

(f) cloning the cDNA fragments of step (e) into a phage display vector, 
to construct a cDNA phage display library. 

5 1 . The method of claim 50, wherein the cDNA phage display library is 

displayed on Gene III. 

52. The method of claim 5 1 , wherein the cDNA phage display library is 
displayed on the N-terminus of Gene III. 

53 . A method of constructing a cDNA phage display library displayed on N- 
terminus of Gene III, comprising: 

(a) obtaining a cDNA of interest; 

(b) cloning the cDNA into a first expression screening vector specific for 
a first selection marker to form a first cDNA screening vector; 

(c) transforming the first cDNA screening vector into host cells; 

(d) selecting clones containing the first cDNA screening vector for the 
first selection marker; 

(e) cloning the selected cDNA from step (d) into a second expression 
screening vector specific for a second selection marker to form a 
second cDNA screening vector; 

(f) transforming the second cDNA screening vector into host cells; 

(g) selecting clones containing the second cDNA screening vector cDNA 
for the second selection marker; 

(h) digesting the selected second cDNA screening vector with multiple 
enzymes in combination, thereby producing cDNA fragments; and 
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(i) cloning the cDNA fragments of step (h) into a phage display vector, 
to construct a cDNA phage display library displayed on the N-. 

54. The method of claim 53, wherein the cDNA phage display library is 

displayed on Gene III. 

55. The method of claim 54, wherein the cDNA phage display library is 
displayed on the N-terminus of Gene III 

56. The method of claim 50, further comprising, cloning the cDNA fragments 
into each one of three possible reading frames of the expression screening vector. 

57. The method of claim 50 or claim 53, wherein the expression screening 
vector contains a unique sequence tag enabling the differentiation of peptide to 
facilitate sequencing. 

58. The method of claim 57, wherein the tag is selected from the group 
consisting of: 5' myc epitope, FLAG, HA, and any unique sequence. 

59. The method of claim 50 or claim 53, wherein the selection marker is selected 
from the group consisting of: green fluorescent protein, kanamycin resistance, 
ampicillin resistance, tetracycline resistance, 0-galactosidase, oxytrophic, and 
luciferase. 

60. The method of claim 53, wherein the first selection marker is kanamycin 
resistant and the second selection marker is a green fluorescent protein. 

61 . The method of claim 50 or claim 53, wherein the enzymes are selected from 
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the group consisting of: a) Narl, Nae, and Avr II; b) EcoRI, Hindlll, and BbvC I; c) 
Bam HI, Xba I, and Ssp I; d) Bgl n, Bgl I, and Xhol; e) Sac I, Sph I, andDra I; f) 
Rsa I; g) Narl , Eco RI, and Avr II; h) EcoRI, Hind III, and Xho I; i) Bam HI, Xbal, 
and Avr II; and j) Bgl II, EcoRI, and Xhol. 
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FIG, 3 



CCGGCCATGGCGATGGTGCGGACTAAAGCAGACAGTGTTCCAGGCACTTACAGA 

AAAGTGGTGGCTGCTCGAGCCCCCAGAAAGGTGCTTGGTTCTTCCACCTCTGCCA 

CTAATTCGACATCAGTTTCATCGAGGAAAGCTGAAAATAAATATGCAGGAGGGA 

ACCCCGTTTGCGTGCGCCCAACTCCCAAGTGGCAAAAAGGAATTGGAGAATTCTT 

TAGGTTGTCCCCTAAAGATTCTGAAAAAGAGAATCAGATTCCTGAAGAGGCAGG 

AAGCAGTGGCTTAGGAAAAGCAAAGAGAAAAGCATGTCCTTTGCAACCTGATCA 

CACAAATGATGAAAAAGAAGCGGCCGCAGGTGCGCCGGTGCCGTATCCGGATCC 

GCTGGAACCGCGTGCCGCATAG 
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DESCRIPTION 


Q9ui50 homo sapiens (human). 
pro0657 (fragment). 5/2000 


Q9ha67 homo sapiens (human), 
cdnaflj 12 155 fis 


Q9h5r3 homo sapiens (human), 
cdna: flj23 147 fis 


Q9h965 homo sapiens (human), 
cdnaflj 12983 fis 


Q9h700 homo sapiens (human). 
cdna:flj21617fis 


Q9hcl6 homo sapiens (human). 
kiaal556 protein (fragment). 
6/2001 


Q9nx85 homo sapiens (human), 
cdna flj20378 fis 


Q9hcn6 homo sapiens (human), 
platelet glycoprotein vi-3. 6/2001 


Q9nr08 homo sapiens (human), 
ubiquitous tpr-motif protein y 
isoform (fragment). 10/2000 


Q9pl47 homo sapiens (human). 
pro2822. 10/2000 


Q9h842 homo sapiens (human), 
cdnaflj 13954 fis 


Q9pl95 homo sapiens (human). 
prol722. 10/2000 
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Phenogenix 
Partner Protein 
Acc. No, 


1015283. 1 


015438. 


099418. 


043739. 


O9UIA0 


1075962 I 




1013972 1 












Phenogenix 
Partner ID 


RAS-GAP2, 

Cytohesin 

1,2,3,4 


TRIO (Triple 
functional 
domain 
protein) 


GNRP 

(Guanine 

nucleotide 

releasing 

nrotein^ 










HSPL Seq 


TIRRWFCLAHW 
GTEGCLART 


EWARCWRWL 

VEGWGQLGT 

VCGSS 


GLGWRDPVCV 
PFRGMRLCLV 






GCXXFXGFCV 


FRXWVEGf 


Target 
Locus 


i9837 




19837 




,9837 1 




10403 1 




No 
Entry 


9768 


9768 


Target 
Protein Acc. 
No. 


INP 066545.1 1 




INP 066545.1 1 




INP 066545.1 1 




INP 006092.1 1 




IAAA65590.1 1 




IAAH07101.1 1 




IAAH07101.1 1 




Target 
DNA Acc. 
No. 


INM 02106 1 




INM 02106 1 




INM 02106 1 




INM 00610 1 


— i 


IU03877 




IBC007101 1 




IBC00710I i 




METHOD 


Biopanning 


Biopanning 


Biopanning 


Biopanning 


Biopanning 


Biopanning 


Biopanning 


Clone Name 


dgen-0101- 
20M-3-AP- 
A158 


dgen-0101- 
20M-3-AP- 
C511 


dgen-0101- 
20M-3-AP- 
G32 


No Entry 




dgen-0104- 
20M-3-KH- 
Motitt 


dgen-0104- 
20M-3-KH- 
Motif2 


DGI 

Alias/ 

Target 

JUL/ 


DGI-3 


DGI-3 


DGI-3 


DGI-9 


DGI- 11 


DGI-2 


DGI-2 


DGI# 


dgen- 
0101 


dgen- 
0101 


dgen- 
0101 


dgen- 
0102 


dgen- 
0103 


dgen- 
0104 


dgen- 
0104 
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oo 
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Phenogenix 
Partner Protein 
Acc, No. 












Phenogenix 
Partner ID 




No 

significant 
hits 








HSPL Seq 


SXGWXFPGWR 


IGROLGGGVA 
MDCDVAVGER 








Target 
Locus 


9768 


11164 




No 
Entry 


9168 


CM 

as] 


Target 
Protein Acc. 
No. 


AAH07101.1 




AAF25479.1 1 




BAA89054.1 1 




AAB25225.1 1 




AAH0873.7.1 1 




Target 
DNA Acc. 
No. 


BC007101 




AF218818 




AB030951 ! 




S54005 1 




BC008737 1 




METHOD 


Biopanning 


Proteome 
Panning 


Proteome 
Panning 


Proteome 
Panning 


Proteome 
Panning 


Clone Name 


dgen-0104- 
20M-3-KH- 
Motif3 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-B11 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-G1 1 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-G5 


072402- 
cDNATet- 
20M-PP-DC- 
F3 


DGI 
Alias / 
Target 

IX/ 


DGI-2 


NUDT5 

(nudix 

hydrolase) 


TNFR2 


thymosin 
beta- 10 


Solute 


carrier 

family 25 

(mitochon 

drial 

adenine 

nucleotide 

translocat 

or) 


DGI# 


dgen- 
0104 


dgen- 
0105 


dgen- 
0106 


dgen- 
0107 


dgen- 
0108 
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oo 
O 



Phenogenix 
Partner Protein 
Acc. No. 


IP98161 1 




I09Y2R5 1 




IP25120 1 








Phenogenix 
Partner ID 

: 


PKDl 

(Polycystin 

precursor) 


28S 

ribosomal 
protein SI 7 


60S 

ribosomal 
protein L8 




No 

significant 
hits 


HSPL Seq 


SKRLCAAGYQ 
DLNVQRWRAL 
RLGPMA 


QNWWTHRWV 
GDIVLRFGTTR 


QRTGRLCNRGT 
ARFASSVGL 


PACRRFSRRLA 
PSRDRGSGP 


QRVSCSPLEHT 
YGGLYGDMGA 
SWRG 


Target 
Locus 


12483 I 




16224 




16224 1 




6224 


9896 


Target 
Protein Acc. 
No. 


1014331 




IAAH07507.1 




IAAH07507.1 1 




AAH07507.1 1 




092562 ! 




Target 
DNA Acc. 
No. 


IL76159 




IBC007507 1 




IBC007507 1 




■BC007507 1 




D87464 1 




METHOD 


Proteome 
Panning 


Proteome 
Panning 


Proteome 
Panning 


Proteome 
Panning 


Proteome 
Panning 


Clone Name 


072402- 
cDNATet- 
20M-PP-DC- 
clone 25 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-B10 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-E1 1 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-F2 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-C12 


DGI 

Alias / 
Target 
ID 


FRGl 


Ribosoma 
1 Protein 
S20 


Ribosoma 
1 Protein 
S20 


Ribosoma 
1 Protein 
S20 


SAC3 

(Sac 

domain 

Ann to i n i n ct 

inositol 

phosphata 

se3) 


DGI# 


dgen- 
0109 


dgen- 
0110 
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0110 


dgen- 
0110 
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0111 
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O 

E 



Phenogenix 
Partner Protein 
Acc. No. 






043490 1 




Phenogenix 
Partner ID 


No 

significant 
hits 


No 

significant 
hits 


PROMLl 
(Prominin- 
like protein) 


HSPLSeq 


TLNSCLSGVIA 
C#CRLSGITIH 


LTGOCNWARD 
PCVVGVTMW 
RRASG 


QVILCCVPGLG 
LVFMRDALGV 
RAGH 


Target 
Locus 


9896 ! 




9896 | 




9896 


Target 
Protein Acc. 
No. 


1092562 1 




1092562 1 




1092562 i 




Target 
DNA Acc. 
No. 


ID87464 ] 




ID87464 1 




ID87464 I 




METHOD 


Proteome 
Panning 


Proteome 
Panning 


Proteome 
Panning 


Clone Name 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-D10 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-D2 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-F9 
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Alias / 
Target 
ID 


SAC3 

(Sac 

domain 

inositol 
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se3) 


SAC3 

(Sac 

domain 

contain in q 
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phosphata 

se3) 


SAC3 

(Sac 

domain 

UUllLalll 111^, 

inositol 

phosphata 

se3) 


DGI# 


dgen- 
0111 
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0111 



WO 03/035839 



PCT/US02/34021 



23/34 



f*3 
oo 



Phenogenix 
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Acc. No. 




1013515 1 








Phenogenix 
Partner ID 


No 

significant 
hits 


Phakinin 






HSPL Seq 


WI#AELFGYAG 
LDMLHVMSGF 
SEVACLLTVPN 
OS 


SFLRASSVSLRS 
SLVTLPRVSGV 
L 


SGWPVGGGPV 
CIALRLAGRR 




Target 
Locus 


19896 1 




9896 


1340 


No 
Entry 


Target 
Protein Acc. 
No. 


1092562 




092562 




IAAH02478.1 1 




No Entry 


Target 
DNA Acc. 
No. 


ID87464 j 




ID87464 j 




BC002478 1 




IAK024337 1 




METHOD 


Proteome 
Panning 


Proteome 
Panning 


Proteome 
Panning 


Proteome 
Panning 


Clone Name 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-G9 


No Entry 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-C2 


072402- 
cDNATet- 
20M-PP-DC- 
D12 


DGI 
Alias/ 
Target 
ID 


SAC3 

(Sac 

domain 

containing 

inositol 

phosphata 

se 3) 


SAC3 

(Sac 

domain 

containing 

inositol 
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oxidase 

bUUUllll 

VIB 


Similar to 

CGI-16 

(unknown 

lUIlLllUIl^ 


DGI# 


dgen- 
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dgen- 
0111 
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0113 
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Phenogenix 
Partner Protein 
Acc. No. 
















Phenogenix 
Partner ID 






No 

significant 
hits 










HSPL Seq 




RTGFCCSAGCQ 

VRVLTRPEGTI 

GMV 


YAVGVQPAIRV 
GVNLDVHMS 


MTVHRLIFTGD 
KGRAYYSVP 


GWRLFLTGVLG 
GSKNGHASL 


WGKTAGECGT 
LLWOMGRAWF 


LGFGSGAHPTT 
REDWIELFR 


Target 
Locus 


No 
Entry 


No 
Entry 


No 
Entry 


179170 1 




14500 i 




7422 


12321 ! 




Target 
Protein Acc. 
No. 


No Entry 


IBAA03400.1 1 




IBAA00469.1 1 




INP 077296.1 1 




ICAA54 136.1 1 




INP 003367.2 1 




INP 002010 1 




Target 
DNA Acc. 
No. 


IAK000548 1 




ID14530 i 




ID00591 ! 




|NM 02432 1 


51 


1X76717 i 




INM 00337 1 




INM 00201 1 


CM 


METHOD 


Proteome 
Panning 


Proteome 
Panning 


Proteome 
Panning 


Biopanning 


Biopanning 


Biopanning 


Biopanning 


Clone Name 


072402- 
cDNATet- 
20M-PP-DC- 
H2 


073002- 
cDNATet- 
20F20C40R- 
PP-DC-C7 


011002- 
OVcDNATet- 
20F-PP-DC- 
Dl 


DGI5-20M-4- 
D2 


i 
i 

S 

o 

i 

Q O 


VEGF-20F-4- 
F8 42 


tin 

o 

i 

S s 
o ^ 

> 


DGI 
Alias / 
Target 
ID 


cDNA 
FLJ20541 
(unknown 
function) 


Human 

homolog 

of yeast 

rihosomal 

protein 

S28 


RCCl 


DGI-5 


DGI-7 


VEGF 


VEGF-R1 
/FLTl 


DGI# 


dgen- 
0114 


dgen- 
0115 


dgen- 
0116 


dgen- 
0117 


dgen- 
0118 
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dgen- 
0120 
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Phenogenix 
Partner Protein 
Acc. No. 


















Phenogenix 
Partner ID 


















HSPL Seq 


FLFPLHFLSPWS 
VGTSHSRI 


FMSPYSLMLRH 
GLMGERQRG 


AVSRLLGDTQR 
LIGYLQEPL 






RRVDAGGAW 
YLDRWGNVSV 


LRQWCRWVM 
GASRRMPFICA 
GWDGP 


LSRPVIPWTSGE 
IGRATRQD 


Target 
Locus 


13791 ! 




12324 1 




11956 1 








17075 ! 




VOl 

m 
oo| 


2:1 


Target 
Protein Acc. 
No. 


IAAB88005.1 1 




INP 002011 ! 




INP 005219 








INP 005415.1 1 




INP 004337 1 




'NP 001609 1 




Target 
DNA Acc. 
No. 


IAF035121 1 




INM 00202 1 




INM 00522 1 


oo| 






INM 00542 1 




INM 00434 1 


vol 


NM 00161 1 


col 


METHOD 


Biopanning 


Biopanning 


Biopanning 






Biopanning 


Biopanning 


Biopanning 


Clone Name 
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